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NUCLEIC ACIDS AND PROTEINS FROM STREPTOCOCCUS PNEUMONIAE 

The present invention relates to protetns derived from S,re P ,ococcus pne^n.ae. 
nuCeic acd monies encoding such pro t =ins. the use of The nucleic acd and or 
orceins as antigens/immunogens and in detecuon/diagnosis, as weU as methods 
screening Ore proteins/nuCeic acid sequences as potentia, anti-microb.a, urge.. 

S,rep,ococcus pneumoniae, commonly referred ,0 as the pneumococcus. is an 
imponan, parhogenic organism. The conning significance of S^occoc* 
infections, in reiation to human disease in developing «. deve.op* 
countries has been authoritative* reviewed (Fiber, G.R., Science. 265: 1385-1387 
(199 4), That indicates that on a globa. sca.e this organism is believed to be the 
most common bacteria, cause of acute respiratory infections, and is estimated to 
resuit in , minion chiidhood deaths each year, mostiy in dev=,oping counmes 
(Stansfieid. S.K.. Pe««, ,nfec, «... * 622 (1987,). .n - USA it has been 
Lasted (Breiman e, a, Arc,. Med., 150: 1*1 <« *- * 

pneumococcus is stil, the most common cause of bacteria, pneumonra. and that 
disease rates are particu.ar,y high in young chi.dren. in the elder,y. and ■„ patten, 
with pressing condittons such as asp.enia. heart, lung and Kdney dtseas. 
diabetes, alcoholism, or with immunosupressive disorders, especrally AIDS. Thes 
groups are at higher ris* of pneumococca, septicaemia and hence meningms and 
.herefore have a greater risk of dying from pneumococca, infection. The 
pneumococcus is aiso the leading cause of otitis media and sinusitis, which reman, 
prevent infections in children in deveioped countries, and which incur subsanttal 



costs. 



The need for effective preventative strategies against pneumococca, infectton is 
highlighted by the recent emergence of peniciUm-resistan, pneumococci. I. has been 
reported tha, 6.6% of pneumccca, iso,ates in 13 US hospi.a,s in 12 su.es were found 
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been used in the development of a vaccine against Haemophilus influenzae, for 
instance. There are, however, issues of cost concerning both the multi- 
poly saccharide vaccines and those based on conjugates. 

A third approach is to look for other antigenic components which offer the potential 
to be vaccine candidates. This is the basis of the present invention. Using a specially 
developed bacterial expression system, we have been able to identify a group of 
protein antigens from pneomococcus which are associated with the bacterial 
envelope or which are secreted. 

Thus, in a first aspect the present invention provides a Streptococcus pneumoniae 
protein or polypeptide having a sequence selected from those shown in table 1 . 

In a second aspect, the present invention provides a Streptococcus pneumoniae 
protein or polypeptide having a sequence selected from those shown in table 2. 

A protein or polypeptide of the present invention may be provided in substantially pure 
form. For example, it may be provided in a form which is substantially free of other 
proteins. 

As discussed herein, the proteins and polypeptides of the invention are useful as 
antigenic material. Such material can be "antigenic" and/or "immunogenic". 
Generally, "antigenic" is taken to mean that the protein or polypeptide is capable of 
being used to raise antibodies or indeed is capable of inducing an antibody response in 
a subject. "Immunogenic" is taken to mean that the protein or polypeptide is capable of 
eliciting a protective immune response in a subject. Thus, in the latter case, the protein 
or polypeptide may be capable of not only generating an antibody response but, in 
addition, a non-antibody based immune response. 



The skilled person will appreciate that homologues or derivatives of the proteins or 
polypeptides of the invention will also find use in the context of the present invention, 
ie as antigenic/immunogenic material. Thus, for instance proteins or polypeptides 
which include one or more additions, deletions, substitutions or the like are 
encompassed by the present invention. In addition, it may be possible to replace one 
amino acid with another of similar "type". For instance replacing one hydrophobic 
amino acid with another. 

One can use a program such as the CLUSTAL program to compare amino acid 
sequences. This program compares amino acid sequences and finds the optimal 
alignment by inserting spaces in either sequence as appropriate. It is possible to 
calculate amino acid identity or similarity (identity plus conservation of amino acid 
type) for an optimal alignment. A program like BLASTx will align the longest stretch 
of similar sequences and assign a value to the fit. It is thus possible to obtain a 
comparison where several regions of similarity are found, each having a different 
score. Both types of identity analysis are contemplated in the present invention. 

In the case of homologues and derivatives, the degree of identity with a protein or 
polypeptide as described herein is less important than that the homologue or derivative 
should retain the antigenicity or immunogenicity of the original protein or polypeptide. 
However, suitably, homologues or derivatives having at least 60% similarity (as 
discussed above) with the proteins or polypeptides described herein are provided. 
Preferably, homologues or derivatives having at least 70% similarity, more preferably 
at least 80% similarity are provided. Most preferably, homologues or derivatives 
having at least 90% or even 95% similarity are provided. 

In an alternative approach, the homologues or derivatives could be fusion proteins, 
incorporating moieties which render purification easier, for example by effectively 
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tagging the desired protein or polypeptide. It may be necessary to remove the "tag" or 
it may be the case that the fusion protein itself retains sufficient antigenicity to be 
useful. 



In an additional aspect of the invention there are provided antigenic/immunogenic 
fragments of the proteins or polypeptides of the invention, or of homologues or 
derivatives thereof. 

For fragments of the proteins or polypeptides described herein, or of homologues or 
derivatives thereof, the situation is slightly different. It is well known that is possible to 
screen an antigenic protein or polypeptide to identify epitopic regions, ie those regions 
which are responsible for the protein or polypeptide's antigenicity or immunogenicity. 
Methods for carrying out such screening are well known in the art. Thus; the fragments 
of the present invention should include one or more such epitopic regions or be 
sufficiently similar to such regions to retain their antigenic/immunogenic properties. 
Thus, for fragments according to the present invention the degree of identity is perhaps 
irrelevant, since they may be 100% identical to a particular pan of a protein or 
polypeptide, homologue or derivative as described herein. The key issue, once again, is 
that the fragment retains the antigenic/immunogenic properties. 

Thus, what is important for homologues, derivatives and fragments is that they possess 
at least a degree of the antigenicity/immunogenicity of the protein or polypeptide from 
which they are derived. 

Gene cloning techniques may be used to provide a protein of the invention in 
substantially pure form. These techniques are disclosed, for example, in J. Sambrook 
ex al Molecular Cloning 2nd Edition, Cold Spring Harbor Laboratory Press (1989). 
Thus, in a third aspect, the present invention provides a nucleic acid molecule 



comprising or consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 1 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which has substantial identity with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 1 . 

In a fourth aspect the present invention provides a nucleic acid molecule comprising or 
consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 2 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which has substantial identity with any of those of (i), (ii) 
and (iii); or 
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( v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 2. 

The nude* acid mdecu.es of the invention may Mud. a plurality of such sequence,, 
and/or fragments. The skilled person wil. appreciate that the present invenuon can 
include novel variants of those particular novel nudeic acid m 0l ecu!es which are 
exemplified herein. Such variants are encompassed by the present inventton. These 
my occur in nature, for example because of strain variation. For example, addmons, 
substtrutions and/or demons are induded. In addition, and particularly when utihstng 
microbial expression systems, one may wish to engineer the nucleic acid sequence by 
making use of known preferred codon usage in the particular organism being used for 
expression. Thus, synthetic or non-nanrral.y occurring variants are also included wrtlun 
the scope of the invention. 

The term "RNA equivalent" when used above indicates that a given RNA molecule has 
a sequence which is complementary to that of a given DNA molecule (allowing for me 
fact that in RNA "U" replaces "T" in the genetic code). 

When comparing nucleic acid sequences for tie purposes of determining the degree of 
homoiogy or identity one can use programs such as BESTFIT and GAP (bom from the 
Wisconsm Genetics Computer Group (GCG) software package) BESTFIT, for 
example, compares two sequences and produces an optimal alignment of the most 
similar segments. GAP enables sequences to be aligned along their whole t«k and 
finds the optimal alignment by inserting spaces in either sequence as appropnate. 
Suitably in the context of tie present invenuon when discussing identity of nucletc actd 
sequences, the comparison is made by alignment of the sequences along their whole 
length. 
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Alternatively PCR techniques may be used to amplify a desired nucleic acid sequence. 
Thus the sequence data provided herein can be used to design two primers for use in 
PCR so that a desired sequence, including whole genes or fragments thereof, can be 
targeted and then amplified to a high degree. 

5 

Typically primers will be at least 15-25 nucleotides long. 

As a further alternative chemical synthesis may be used. This may be automated. 
Relatively short sequences may be chemically synthesised and ligated together to 
10 provide a longer sequence. 

There is another group of proteins from S.pneumoniae which have been identified 
using the bacterial expression system described herein. These are known proteins 
from 5. pneumoniae, which have not previously been identified as antigenic proteins. 

15 The amino acid sequences of this group of proteins, together with DNA sequences 
coding for them are shown in Table 3. These proteins, or homologues, derivatives 
and/or fragments thereof also find use as antigens/immunogens. Thus, in another 
aspect the present invention provides the use of a protein or polypeptide having a 
sequence selected from those shown in Tables 1-3, or homologues, derivatives 

20 and/or fragments thereof, as an immunogen/ antigen. 

In yet a further aspect the present invention provides an immunogenic/antigenic 
composition comprising one or more proteins or polypeptides selected from those 
whose sequences are shown in Tables 1-3, or homologues or derivatives thereof, 
25 and/or fragments of any of these. In preferred embodiments, the 

immunogenic/antigenic composition is a vaccine or is for use in a diagnostic assay. 



In the case of vaccines suitable additional excipients, diluents, adjuvants or the like 




may be included. Numerous examples of these are well known in the art. 

It is also possible to utilise the nucleic acid sequences shown in Tables 1-3 in the 
preparation of so-called DNA vaccines. Thus, the invention also provides a vaccine 
composition comprising one or more nucleic acid sequences as defined herein. DNA 
vaccines are described in the art (see for instance, Donnelly et al , Ann. Rev. 
Immunol, 15:617-648 (1997)) and the skilled person can use such art described 
techniques to produce and use DNA vaccines according to the present invention. 

As already discussed herein the proteins or polypeptides described herein, their 
homologues or derivatives, and/or fragments of any of these, can be used in methods 
of detecting/diagnosing S. pneumoniae. Such methods can be based on the detection 
of antibodies against such proteins which may be present in a subject. Therefore the 
present invention provides a method for the detection/diagnosis of S. pneumoniae 
which comprises the step of bringing into contact a sample to be tested with at least 
one protein, or homologue, derivative or fragment thereof, as described herein. 
Suitably, the sample is a biological sample, such as a tissue sample or a sample of 
blood or saliva obtained from a subject to be tested. 

In an alternative approach, the proteins described herein, or homologues, derivatives 
and/or fragments thereof, can be used to raise antibodies, which in turn can be used 
to detect the antigens, and hence S.pneumoniae. Such antibodies form another aspect 
of the invention. Antibodies within the scope of the present invention may be 
monoclonal or polyclonal. 



Polyclonal antibodies can be raised by stimulating their production in a suitable animal 
host (e.g. a mouse, rat, guinea pig, rabbit, sheep, goat or monkey) when a protein as 
described herein, or a homologue, derivative or fragment thereof, is injected into the 
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animal. If desired, an adjuvant may be administered together with the protein. WeU- 
known adjuvant include Freund's adjuvant (complete and incomplete) and aiumunum 
hydroxide. The antibodies can men be purifed by virrue of their binding to a protetn as 

described herein. 

Monodonal antibodies can be produced from hybridomas. These can be formed by 
fusing myeloma ceUs and spleen cells which produce the desired antibody in order to 
form an immortal cell line. Thus me well-known Kohler * Milstein technique (.mure 
256 (1975)) or subsequent variations upon this technique can be used. 

Techniques for producing monoclonal and polyclonaJ antibodies that bind to a 
particular polypeptide/protein are now weU deve.oped in the art. They are discussed » 
standard immunology textbooks, for example in Roin e, al, Immunol second edmon 
(1989), Churchill Livingstone, London. 

!„ addition to whole antibodies, the present invention includes derivatives thereof which 
« capable of binding to proteins etc as described herein. Thus the present invention 
includes antibody fragment and synthetic construe*. Examples of antibody fragments 
and synthetic contracts are given by Dougal. e, al in TtoecH ,2 372-379 (September 
1994). 

Antibody fragments include, for example. Fab. F(ab'), and Fv fragments. Fab 
fragments (These are discussed in Roit. « al «*«] ). Fv fragments can be modified 
l0 produce a synutetic construct known as a single chain Fv (scFv) molecule. Tms 
includes a peptide linker covalently joining V. and V, regions, which contributes to the 
stability of the molecule. Other synthetic constructs ma, can be used include CDR 
peptides. These are synthetic peptides comprising antigen-binding determinants. 
Peptide mimetics may also be used. These molecules are usually conformational!. 
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provides a method for the detection/diagnosis of S.pneumoniae which comprises the 
step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as described herein. Suitably, the sample is a biological sample, such as a 
tissue sample or a sample of blood or saliva obtained from a subject to be tested. 
5 Such samples may be pre-treated before being used in the methods of the invention. 
Thus, for example, a sample may be treated to extract DNA. Then, DNA probes 
based on the nucleic acid sequences described herein (ie usually fragments of such 
sequences) may be used to detect nucleic acid from S.pneumoniae. 

10 In additional aspects, the present invention provides: 



(a) a method of vaccinating a subject against S.pneumoniae which comprises the 
step of administering to a subject a protein or polypeptide of the invention, or a 
derivative, homologue or fragment thereof, or an immunogenic composition of the 

15 invention; 

(b) a method of vaccinating a subject against S.pneumoniae which comprises the 
step of administering to a subject a nucleic acid molecule as defined herein; 

20 (c) a method for the prophylaxis or treatment of S.pneumoniae infection which 
comprises the step of administering to a subject a protein or polypeptide of the 
invention, or a derivative, homologue or fragment thereof, or an immunogenic 
composition of the invention; 
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(d) a method for the prophylaxis or treatment of S.pneumoniae infection which 
comprises the step of administering to a subject a nucleic acid molecule as defined 
herein; 



(e) a kit for use in detecting/diagnosing S. pneumoniae infection comprising one 
or more proteins or polypeptides of the invention, or homologues, derivatives or 
fragments thereof, or an antigenic composition of the invention; and 

5 (f) a kit for use in detecting/diagnosing S.pneumoniae infection comprising one 
or more nucleic acid molecules as defined herein. 

Given that we have identified a group of important proteins, such proteins are 
potential targets for anti-microbial therapy. It is necessary, however, to determine 
10 whether each individual protein is essential for the organism's viability. Thus, the 
present invention also provides a method of determining whether a protein or 
polypeptide as described herein represents a potential anti-microbial target which 
comprises antagonising, inhibiting or otherwise interfering with the function or 
expression of said protein and determining whether S.pneumoniae is still viable. 

15 

A suitable method for inactivating the protein is to effect selected gene knockouts, ie 
prevent expression of the protein and determine whether this results in a lethal 
change. Suitable methods for carrying out such gene knockouts are described in Li 
et at , P.N.A.S.. 94:13251-13256 (1997) and Kolkman et al , 178:3736- 
20 3741 (1996). 

In a final aspect the present invention provides the use of an agent capable of 
antagonising, inhibiting or otherwise interfering with the function or expression of a 
protein or polypeptide of the invention in the manufacture of a medicament for use in 
25 the treatment or prophylaxis of S.pneumoniae infection. 

As mentioned above, we have used a bacterial expression system as a means of 
identifying those proteins which are surface associated, secreted or exported and 



thus, would find use as antigens. 



The information necessary for the secretion/export of protejns has been extensively 
studied in bacteria. In the majority of cases, protein export requires a signal peptide 
to be present at the N-terminus of the precursor protein so that it becomes directed to 
the translocation machinery on the cytoplasmic membrane. During or after 
translocation, the signal peptide is removed by a membrane associated signal 
peptidase. Ultimately the localization of the protein (i.e. whether it be secreted, an 
integral membrane protein or attached to the cell wall) is determined by sequences 
other than the leader peptide itself. 

We are specifically interested in surface located or exported proteins as these are 
likely to be antigens for use in vaccines, as diagnostic reagents or as targets for 
therapy with novel chemical entities. We have therefore developed a screening 
vector-system in Lactococcus lactis that permits genes encoding exported proteins to 
be identified and isolated. We provide below a representative example showing how 
given novel surface associated proteins from Streptococcus pneumoniae have been 
identified and characterized. The screening vector incorporates the staphylococcal 
nuclease gene nuc lacking its own export signal as a secretion reporter. 
Staphylococcal nuclease is a naturally secreted heat-stable, monomeric enzyme 
which has been efficiently expressed and secreted in a range of Gram positive 
bacteria (Shortle, Gene, 22:181-189 (1983); Kovacevic et al., J. BacterioL, 
162:521-528 (1985); Miller et al., J. Bacterial., 169:3508-3514 (1987); Liebl et al., 
J. BacterioL, 174:1854-1861 (1992); Le Loir et al., J. BacterioL, 176:5135-5139 
(1994); Poquet et al., J. BacterioL, 180:1904-1912 (1998)). 

Recently, Poquet et al. ((1998), supra) have described a screening vector 
incorporating the nuc gene lacking its own signal leader as a reporter to identify 
exported proteins in Gram positive bacteria, and have applied it to L. lactis. This 
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different frames was included to prevent translational fusions between vector genes 
and cloned DNA fragments. The pTREX7 vector was first digested with EcoRI and 
blunted using the 5' - 3' polymerase activity of T4 DNA polymerase (NEB) 
according to manufacturer's instructions. The EcoRI digested and blunt ended 
pTREX7 vector was then digested with Bgl II thus removing the P7 promoter. The 
artificial DNA fragment derived from the annealed synthetic oligonucleotides was 
then digested with EcoRV and Bam HI and cloned into the EcoRI(blunted)-Bgl II 
digested pTREX7 vector to generate pTREP. A Laaococcus lactis MG1363 
chromosomal promoter designated PI was then cloned between the EcoRI and Bglll 
sites present in the pTREP expression cassette forming pTREPl. This promoter was 
also isolated using the promoter probe vector pSB292 and characterised by 
Waterfield ei al. y (1995), supra. The PI promoter fragment was originally 
amplified by PCR using vent DNA polymerase according to manufacturers 
instructions and cloned into the pTREX as an EcoRI-BgUI DNA fragment. The 
EcoRI-BgUI PI promoter containing fragment was removed from pTREXl by 
restriction enzyme digestion and used for cloning into pTREP (Schofield et aL pers. 
corns. University of Cambridge, Dept. Pathology.). 

(b) PCR amplification of the S. aureus nuc gene . 

The nucleotide sequence of the S. aureus nuc gene (EMBL database accession 
number V01281) was used to design synthetic oligonucleotide primers for PCR 
amplification. The primers were designed to amplify the mature form of the nuc 
gene designated nucA which is generated by proteolytic cleavage of the N-terminal 
19 to 21 amino acids of the secreted propeptide designated Snase B (Shortle, (1983), 
supra). Three sense primers (nucSl, nucS2 and nucS3, Appendix 1) were designed, 
each one having a blunt-ended restriction endonuclease cleavage site for EcoRV or 
Smal in a different reading frame with respect to the nuc gene. Additionally Bglll 
and BamHI were incorporated at the 5' ends of the sense and anti-sense primers 



respectively to facilitate cloning into BamHI and Bglll cut pTREPl. The sequences 
of all the primers are given in "Appendix 1 . Three nuc gene DNA fragments 
encoding the mature form of the nuclease gene (NucA) were amplified by PCR using 
each of the sense primers combined with the anti-sense primer described above. The 
nuc gene fragments were amplified by PCR using S. aureus genomic DNA template, 
Vent DNA Polymerase (NEB) and the conditions recommended by the 

manufacturer. An initial denaturation step at 93 °C for 2 min was followed by 30 

cycles of denaturation at 93 °C for 45 sec, annealing at 50 °C for 45 seconds, and 

extension at 73 °C for 1 minute and then a final 5 min extension step at 73 °C. The 
PCR amplified products were purified using a Wizard clean up column (Promega) to 
remove unincorporated nucleotides and primers. 

(c) Construction of the pTREPl-nuc vectors 

The purified nuc gene fragments described in section b were digested with Bgl II and 
BamHI using standard conditions and ligated to BamHI and Bglll cut and 
dephosphorylated pTREPl to generate the pTREPl-nucl, pTREPl-nuc2 and 
pTREPl-nuc3 series of reporter vectors. General molecular biology techniques were 
carried out using the reagents and buffer supplied by the manufacture or using 
standard conditions(Sambrook and Maniatis, (1989), supra). In each of the pTREPl - 
nuc vectors the expression cassette comprises a transcription terminator, lactococcal 
promoter PI, unique cloning sites (Bglll, EcoRV or Smal) followed by the mature 
form of the nuc gene and a second transcription terminator. Note that the sequences 
required for translation and secretion of the nuc gene were deliberately excluded in 
this construction. Such elements can only be provided by appropriately digested 
foreign DNA fragments (representing the target bacterium) which can be cloned into 
the unique restriction sites present immediately upstream of the nuc gene. 




In possessing a promoter, the pTREPl-nuc vectors differ from the pFUN vector 
described by Poquet et al. (1998), supra, which was used to identify L. laais 
exported proteins by screening directly for Nuc activity directly in L. laais. As the 
pFUN vector does not contain a promoter upstream of the nuc open reading frame 
the cloned genomic DNA fragment must also provide the signals for transcription in 
addition to those elements required for translation initiation and secretion of Nuc. 
This limitation may prevent the isolation of genes that are distant from a promoter 
for example genes which are within polycistronic operons. Additionally there can be 
no guarantee that promoters derived from other species of bacteria will be 
recognised and functional in L. laais. Certain promoters may be under stringent 
regulation in the natural host but not in L. laais. In contrast, the presence of the PI 
promoter in the pTREPl-nuc series of vectors ensures that promoterless DNA 
fragments (or DNA fragments containing promoter sequences not active in I. laais) 
will still be transcribed. 

(d) Screening for secreted proteins in S. pneumoniae 

Genomic DNA isolated from 5. pneumoniae was digested with the restriction 
enzyme Tru9I. This enzyme which recognises the sequence 5'- TTAA -3' was used 
because it cuts A/T rich genomes efficiently and can generate random genomic 
DNA fragments within the preferred size range (usually averaging 0.5 - 1.0 kb). 
This size range was preferred because there is an increased probability that the PI 
promoter can be utilised to transcribe a novel gene sequence. However, the PI 
promoter may not be necessary in all cases as it is possible that many Streptococcal 
promoters are recognised in L. laais. DNA fragments of different size ranges were 
purified from partial Tru9I digests of 5. pneumoniae genomic DNA. As the Tru 91 
restriction enzyme generates staggered ends the DNA fragments had to be made 
blunt ended before ligation to the EcoRV or Smal cut pTREPl-nuc vectors. This 
was achieved by the partial fill-in enzyme reaction using the 5' -3' polymerase ■ 




activity of Klenow enzyme. Briefly Tru9I digested DNA was dissolved in a solution 
(usually between 10-20 /xl in total) supplemented with T4 DNA ligase buffer (New 
England Biolabs; NEB) (IX) and 33 of each of the required dNTPs, in this case 
dATP and dTTP. Klenow enzyme was added (1 unit Klenow enzyme (NEB) per fig 
of DNA) and the reaction incubated at 25°C for 15 minutes. The reaction was 
stopped by incubating the mix at 75°C for 20 minutes. EcoRV or Smal digested 
pTREP-nuc plasmid DNA was then added (usually between 200-400 ng). The mix 
was then supplemented with 400 units of T4 DNA ligase (NEB) and T4 DNA ligase 
buffer (IX) and incubated overnight at 16°C. The ligation mix was precipitated 
directly in 100% Ethanol and 1/10 volume of 3M sodium acetate (pH 5.2) and used 
to transform L. lactis MG1363 (Gasson, 1983). Alternatively, the gene cloning site 
of the pTREP-nuc vectors also contains a BgHI site which can be used to clone for 
example Sau3AI digested genomic DNA fragments. 

L. lactis transformant colonies were grown on brain heart infusion agar and nuclease 
secreting (Nuc + ) clones were detected by a toluidine blue-DNA-agar overlay (0.05 
M Tris pH 9.0, 10 g of agar per litre, 10 g of NaCl per liter, 0.1 mM CaC12, 0.03% 
wt/vol. salmon sperm DNA and 90 mg of Toluidine blue O dye) essentially as 
described by Shortle, 1983, supra and Le Loir et aL, 1994, supra). The plates were 
then incubated at 37°C for up to 2 hours. Nuclease secreting clones develop an 
easily identifiable pink halo. Plasmid DNA was isolated from Nuc + recombinant L. 
lactis clones and DNA inserts were sequenced on one strand using the NucSeq 
sequencing primer described in Appendix 1 , which sequences directly through the 
DNA insert. 



Isolation of Genes Encoding Exported Proteins from 
S. pneumoniae 

A large number of gene sequences putatively encoding exported proteins in 5. 
pneumoniae have been identified using the nuclease screening system. These have 
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modified insert of interest and which aHows to v*o transcription of a cloned gene in 

the sense orientation. 

„ _ t QVl „ p McClartv G and Brunham, RC (1997) 

?Kcci^^^^ 

immunity to Chlamydia trachomatis (mouse pneumonitis) infection . Infection 
Immunity, 176, 1035-40. 

Kurar, E. and Splitter, G.A. (1997) "Nucleic acid vaccination of Brucella abortus 
ribosomal L7/L12 gene elicits immune response". Vaccine, 15, 1851-57. 

a ^rcnn R Gao X -M Papakonstantinopoulou, A., Roberts, M. and Dougan, 
-pise in'mice following — ation DNA encoding 
fragment C of tetanus toxin". Infection and Immunity, 64, 3168-3173. 

Preparation nf DNA vaccines 

Lode only the mautre portion of me gene protetn. * ™ ^J^^STS-d 
those sequences >hal encode only Ihe mature pomon of a urget gene pro 

r a ^ toes not rna k e up any pan of the mature P—^Vteminal 

urge, fte who!e of the gene sequence for doning and ultimately, expression ,n 
pcDNA3.1. 

in the genes encoding the proteins of interest were excluded during the 
design of oligonucleotides. These features included: 

1. LPXTG cell wall anchoring motifs. 

2. LXXC ipoprotein attachment sites. 

3. Hydrophobic C-tenninal domain. 



4. Where no N-terminal signal peDtide or T yvr 

excluded. P P 6 0r LXXC was Present the stan codon was 

5 Where no hydrophobic C-terminal domain or LPXTr mnrif 

codon was removed. LPXTG motif was present the stop 

Appropriate PCR primers were desiened for n.u „ 

the regions encoding the above ^^'T "* and 3,1 of 
these primers. The primers were des ened wirTT ^ ^ When desi ^S 
site followed by a conserved Kcaak nStotid? appro P nate e ^e restriction 
occasional instances for example ID59) °GcSrr ^ m ° St CaSCS(NB exce P l in 
facilitates the recognition of initiatoVseauenl, k u USCd ' The K ° Zak se <J uen <* 
ATG stan codon upstream of the Zen of £ * en ^ otic ribos °*«> and an 
forward primer using a BamHl sitTThe f 8ene ° f mtereSt - For exa ^Ple the 
GCGGGATCCGCCAC?^ V thc J >Iwner wou ^ begin 

of interest. tZ£&^ G ~^«<* 5 " «« ° f ■« 

primer and with a Notl Ltriction site af Ae 5 be H COm P atlble the forward 
TTGCGGCCGC) (NB except in occasion^ " m ° St ^ (this site * 

Xhol site was used instead of NoU ) mStanCeS eXample 1059 where * 

PCR primprc 

Tgg. ing PCR prtaere were designee, „ d used » ^ ^ ^ ^ 
ID5 

Forward Primer 5' 
ID59 



ID51 




ID29 

Forward Primer 5' CGGATCCGCCACCATGCAAAAAGAGCGGTATGGTTATG 
3' 

Reverse Primer 5' TTGCGGCCGCACCCCCATTCTTAATCCCTT 3' 
ID50 

Forward Primer 5' 

CGGATCCGCCACCATGGAGGTATGTGAAATGTCACGTAAA 3 f 
Reverse Primer 5* TTGCGGCCGCTTTTACAAAGTCAAGCAAAGCC 3 1 

Cloning 

The insert along with the flanking features described above was amplified using PCR 
against a template of genomic DNA isolated from type 4 S. pneumoniae strain 11886 
obtained from the National Collection of Type Cultures. The PCR product was cut 
with 'the appropriate restriction enzymes and cloned in to the multiple cloning site of 
pcDNA3.1 using conventional molecular biological techniques. Suitably mapped 
clones of the genes of interested were cultured and the plasmids isolated on a large 
scale (> 1.5 mg) using Plasmid Mega Kits (Qiagen). Successful cloning and 
maintenance of genes was confirmed by restriction mapping and sequencing -700 
base pairs through the 5' cloning junction of each large scale preparation of each 
construct. 

Strain validation 

- A strain of type 4 was used in cloning and challenge methods which is the strain 
from which the S. pneumoniae genome was sequenced. A freeze dried ampoule of a 
homogeneous laboratory strain of type 4 S, pneumoniae strain NCTC 11886 was 
obtained from the National Collection of Type Strains. The ampoule was opened and 
the cultured re suspended with 0.5 ml of tryptic soy broth (0.5% glucose, 5% 
blood). The suspension was subcultured into 10 ml tryptic soy broth (0.5% glucose, 
5% blood) and incubated statically overnight at 37 °C. This culture was streaked on 
to 5 % blood agar plates to check for contaminants and confirm viability and on to 
blood agar slopes and the rest of the culture was used to make 20% glycerol stocks. 
The slopes were sent to the Public Health Laboratory Service where the type 4 
serotype was confirmed. 

A glycerol stock of NCTC 1 1886 was streaked on a 5% blood agar plate and 
incubated overnight in a C02 gas jar at 37°C. Fresh streaks were made and optochin 
sensitivity was confirmed. 
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Pneumococcal challpno* 



Streak pneumococcal culture and confirm identity 
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Grow over-night culture from 4-5 colonies on plate above 



Animal passage pneumococcal culture 
(i.p. injection of cardiac bleed to harvest) 
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Grow over-night culture from animal passaged 



pneumococcus 



Grow day culture (to pre-determined optical density) from over-nieht of a nm, a t 
passage and freeze down at -70«C - This h .ndJJSSl 
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Thaw one aliquot of standard inoculum to viable count 

I 
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Use standard inoculum to determine effective dose (called Vinuence Testing) 
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P value 2 refers ,o significance tests compared ,„ pc D NA3. 1 + vaccinal conrro.s 

Statisti cal Analyses 

Trial 1 - None of the other groups had significant,, longer survival times than the 
control The survival times of the unvaccinated and pcDNA3 1 conn-T^T 
no. significan.lv different. One o, the mice from .D5 Tan ou^g ^ "and ™ 
mean survWa, times for ID5 were extended on, „ ol sigIlificamly ™ 

^zzzzzr ,D59 had si — y — — - 

Trial 5 - The group vaccinated with ID59 aeain ™ rv i^ e 

u , J? again survived for an average of aim n « ir» 

hours longer than the controls but the results w er . nnr • - 

Trial Th*„ me results were not quite statistically significant 

hler t "ha?„ T"" " haVC ^ signfftcanti" 

nigher than unvaccinated controls Cd= <r^fi m h« «-»«uy 
in the vaccinated group. * <36 0 >- "°»"ver. mere were 2 outlying mice 

Vaccine trials 7 and 8 (See figure 2) 
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Mouse 
number 


Me. 


an survival times (hours) 


Unvacc 
control (7) 


ID29 (7) 


Unvacc 
control (8) 


ID50 (8) 


1 


59.6 


73.1 


45.1 


60.6 


2 


47.2 


54.8 


50.8 


60.6 


3 


59.6 


59.3 


60.4 


51.1 


4 


70.9 


54.8* 


55.2 


60.6 


5 


68.6* 


59.3 


45.1 


60.6 


6 


76.0 


54.8 


45.1 


60.6 


Mean 


63.6 


59.35 


50.2 


59.1 


sd 


10.3 


7.1 


6.4 


3.9 


p value 1 




<39.0 




0.0048 



- J ** avc received run inoculum 

T - terrmnated a, end of experiment having „ 0 symptonK of 
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Numbers in brackets - survival times disregarded assuming incomplete dosing 
p value 1 refers to significance tests compared to unvaccinated controls 

Statistical Analyses. 

Trial 7 - The ID29 vaccinated group showed prolonged times to the first death. T 
Trial 8 - The group vaccinated with ID50 survived significantly longer than 
unvaccinated controls. 
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Appendix I - Oligonucleotide primers 
nucSl 

Bgl II Eco RV 
5 5 ' - cgagatctgaiatctcacaaacagataacggcgtaaatag -3 ' 

nucS2 

Bgl II Sma I 

5'- gaagatcttccccgggatcacaaacagataacggcgtaaatag -3* 

10 

nucS3 

Bgl II Eco RV 
5'- cgagatctgatatccatcacaaacagataacggcgtaaatag -3' 

15 nucR 

Bam HI 

5'- cgggatccttatggacctgaatcagcgttgtc -3' 
NucSeq 

20 5 * - ggatgctttgtttcaggtgtatc -3 ' 
pTREPF 

5'- catgatatcggtacctcaagctcatatcattgtccggcaatggtgtgggctraragttttagcggaiaa 
caatttcacac -3 ' 

25 

pTREPR 

5 ' - gcggatcccccgggcttaattaatgtttaaacactagtcgaagatctcgcgaattctcctgtgtgaaatt 
gttatccgcta -3 1 

30 pUCF 

5'- cgccagggttttcccagtcacgac -3 1 

VR 

5'- tcaggggggcggagcciatg -3* 

35 

Vi 

5'- tcgtatgttgtgtggaattgtg -3' 
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V 2 

5'- tccggctcgtatgttgtgtggaattg -3' 
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TABLE 1 



ID4 1200 bo 



ATGAGAAATATGTGGCTTGTAATCAAGGAAACCTATCTTCGACATGTCGAGTCATGGA Ol 1 IC ' I ' I L ' l ' n 'ATGGTGA 
TTTCGCCGTTCCTci i i i i AGGAATCTCTGTAGGAATTGGGCATCTCCAAGGTTCTTCTATGGCTAAAAATAATAA 
AGTGGCAGTAGTGACAACAGTGCCATCTGTAGCAGAAGGACTGAAGAATGTAAATGGTCTTTAACTTCGACTATAA 

in a ^^F^^ a ^^^^^^ ^^ a ^ a ^^ a ^^aa^aa^Q^gaaaaattaaaaggttatttgaccattgatcaagaagata 
iu gtgttctaaaggcagtttatcatggcgaaacatcgcttgaaaatggaattaaatttgaggttacaggtacactca 
atgaactgcaaaatcagcttaatcgttcaactgcttccttgtctcaagagcaggaaaaacg 
ttca attcacag aaaagattgatgaagccaaggaaaataaaaagtttattcaaacaattgcagcaggtgccttag 
gattci iiLii i atatc^ctgattacctatgcgggtgtaacagctcaggaagttgccagtgaaaaaggcaccaa 

A ATTA TGGAAGTCG i i i I l iCTAGCATAAGGGCAAGTCACTATTTCTATGCGCGGATGATGGCTCTGTTTCTAnTn 
1 ^ ATrTTAACGCA TATT GGGATCTATGTTGTAGGTGGTCTGGCTGCCCrrrrrGCTCm 
TCAGTCTGGTATTTTGGATCACTTGGGAGATGCTAT^ 

TGTAC GTAG TCTTGGCAGC CTTCCTAGG ATCTATGGTTTCTCOTCCTGAGGACTCAG 

GATGATTTTGATTATGGGTGGT7TTTTTGGAGTGACAGCTCTAGGTGCAGCTGGTGACAATCTCCT 

GGTTCTTATATTCCCTTTATTTCGACCTTCTTTATGCCGTTTCGAACGATTA^ 

CATGGATTTCACTTGCTATTACAGTGATTTTTGCGGTGGTAGCAACAGGATTTATCGGACGCATGTATGCT 
CGTTCTTCAAACGGATGATTTAGGGATTTGGAAAACCTTTAAACGTGCCTTATCTTATAAATAG 

MRNMWVVIKETYLRHVESWSFFFMVISPFLFLGISVGIGHLQGSSMAKNNKVAVVTTVPS^ 
EASAKEAIKEEKLKGYLTIDQEDSVIJCAVYHGETSL£NGIKFEVTGTLNELQNQ 
23 DEAKEhnCKFIQTlAAGALGFFLYMILrrYAG>rrA0EVASEKGTKIMEVVFSSIRA5HYFYARMMALFL^ 
GLAAVLi-FKDLPFLAQSGILDHLGDAISLNTLXHLISLFMYVVLAAFLGSMVSRPEDSGKAl^PLMn^ 
^GDNIXI-KIGSYIPFISTFFMPFRTINDYAGGAEAWISLAITVIFAVVATGFIGRMYASLVLQTDDLGI 

30 ID5 1125 bo 

CCTGGGAAAGTCTTGAAAATTATGATAGAATGGTGGAAGGAAAAATTCAGGAGAGTAGTAGTGACTCAAAATGTT 
G A A AG TCTTCT CGT ATCC ATTGT A ATC AGTG CAT A C AATG A AG A AAA AT ATCTGCCTG GTCT A ATTG A AG ACTT AA 

AAAATCAAACCTATCCTAAAGAGGATATTGAAATTCTATTTATAAATGCTATGTCCACAGATGGGACCACAGCTA 

TCATTCAGCAATTTATAAAGGAAGATACAGAGTTTAACTCAATT 

CTAGTGGTTp-AACCTGGGAGTTAAACATTCTGTAGGGGACCTTAT^ 

TGAGACTTTTGTAATGAACAATGTGGCTATTATTCAACAAGGTGAATTTGTCTGTGGGGGGCCTAGACCGACGATT 
GTCGAAGGAAAAGGAAAATGGGCAGAGACCTTGCATCTTGTTGAGGAAAATATGTTTGGCAGTAGCATTGCCAAT 
TATCGAAATAGTTCTGAGGATAGATATGTTTCTTCTATTrTTCATGGAATGTATAAACGAGA 
W TTGGTTn"AGTAAATGAGCAACTTGGCCGAACTGAAGATAATGATATTCATTATAGAATTCGAGAATATGGTTATAA 
AATCCGCTATAGCCCAAGTATTCTATCTTATCAGTATATTCGACCAACATTCAAGAAAATGCTGCATCAAAAGTAT 
TCAAATCGTTTGTGGATTGGCTTGACAAGTCATGTTCAGm 

>«< A iizr c i±sr r ^ GTCA ^ ACTGA 

4D A IP TA *rrrrCCATTCACTrrGCTTATGGCCTrGGGACGATTGTAGGm 

AGTACAAGAGAACAATAATTTATTTGGATAAAATAAGCCAAATAAATCAAAATATGCTATAA 

rcKVLKIMIEWWKEKFRRVVVTQNVESIXVSIVISAYNEEKYLPGLreDLKNQTYPKEDIEILRNAMSTDCOT 
EDTEFNSIRLYNNPKKNQASGFNLGVKHSVGDLILKIDAHSKVTETFVMNNVAIICXKJEFVCGGPRJ^ 
50 LHLVEENMFGSSIANYRNSSEDRYVSSIFHGMYKREVFQKVGLVNEQLGRTEDN^ 
TFKKMLHQKYSNGLWIGLTS^^ 

LIVMPF1LFSIHFAYGLGTIVGLIRGFKWKKEYKRTIIYLDKISQINQNML2 H-KtlKNtjr 



ID11 696 bp 



ATCATGAAAGAACAAAATACCATAGAAATCGAT^^ 
ATmAATAGTGGCACTTCrrGACAGGTGCGGGGGCT^ 

^ A ^ CGCGAA ™ ACCTAGTGAATC ^^ 

so GAA ^ A ^ GGTA ^ AAGACTACC ^^ 

60 ACTAGATTTGACGCCAAAAGGTrrGGCTAATAAA^ 

TCAGTTAATGATCGAGTTCCTGAACAGGCAACCCGTATCGCTAA 
A^CTATTACTCGTGTTTCTGAOT^^ 

ATTAAACGCAATACACTAATTGG1 1 1 I I I GGCAGGGGTGaTTGG AACTAGTGTTATAGTTCTTCATCTTGAACTTTT 



15 



20 



35 



40 



45 



60 



65 



4 



37 



GGATACTCCTGTGAAACCn'CCGGAAGATATCGAAAATACArrGCAGATCACACTTTTGGGACTTGTGCCAAACTT 
GGGTAAGTTGAAATAG 

MMKEQhmEIDVFQLVKSLWKRKLMlLIVAL^ 
D VKI5YREIIUQDVI^EVVSDLKI^LTPKGUVN^ 

TLEEARPAlSPSSPNlKR>rrLIGFLAGVIGTSVIVLHLELLDTRVKiU'EDIEOTLOMTLLGVVPNLGKl^ 

1D19 555 bp 

10 ATGGTAAAAGTAGCAGTTATATTAGCTC AGGG CTTTGAAGAAATrGAAGCCTTGACACnTGTAGATCT 
GAGCCAATATCACATGTGATATGGTTGGTTTTGAAGAGCAAGTAACGGGTTCGCATGCAATC 
ATCATGTCTTTGATGGAGATTrATCAGACTATGATATGATTGTTCTTCCTGGAGGTATGCCTGGTTCTGC 
CGTGATAATCAGACCTTGATTCAAGAATTGCAAAGCTTCGAGCAAGAAGGGAAGAj^ACTAGCAGCCATTTGTGCG 
GCACCAATTGCCCTCAATCAAGCAGAGATATTGAAAAATAAGCGATACACTTGTTATGACGGCGTrCAAGAGCAA 
ATCCTTGA TGGT CACTACGTCAAGGAAACAGTAGTGGTAGATGGTCAGTTGACAACCAGTCGGGGTCCTTrAArA 
GCCCTTGC CTTTG CCTACGAGTTGGTGGAGCAACTAGGAGGGGACGCAGAGAGTTTACGAACAGGAATnf^rTAT 
CGAGATGTCTTTGGTAAAAATCAGTAA 



MVKVAVILAQGFEEIEALTWDVLRRANrrCDMVGFEEQVTGSHAIQVRADHVFDGDLSDYDMIVLPG^ 
DNQTLIQELQSFEOEGKXL^AICAAPIALNOAEiUCNKRYTCYDGVQEQILDGHYVKETVVVIXSOLTTSR 
YELVEQLGGDAE5LRTGMLYRDVFGKNQ2 

ID27 306 bp 

25 GTG GT AGG G ATG GT AG AA CC AA ACCT A G AA A G CCTT AT A A A A G ATCTTT AC AATC ATGCTCG AC ATG ATTTG AGT 

GAAGATTTAGTTGCTGCTCTCCTAGAGACTACTAAAAAACTGCCrACTA 
TCTCAGGCCTGGTCAAXCGTGAATTGCTCCTAAATCCCAAACATCCAG 

TGTCAj\AAGaGAAGAAGCCAAGTACAGAGGAACTGCGACTTCTGCGCTTATGTATGAGGAACTCTTTAAAATGCT 
TTGA 

30 

^eakyIgta^s^ 

ID29 945 bo 

ttgttcttaaaaaaggaaagagaggta^cagcatgcgtaaatggacaaa^ 

ACTACCGTTATCGGCTTTATCCTGCTTTTTGTAGGTATCCAATCT 

AGAACCTGTCrATGATAGCCGTACGGAAAAGCTAACCTTTGGCAAGGAAGTCGAAAACCTAGAAATTACTCTCCA 

ccaacacacgctcaccatcacagactctttcgatgatcaaatccacatttcttaccatccatctctttctgctcac 
catgatcttatcaccaatcagaacgatagaactctgagtctcactgataagaaactgtctgaaactccgtttctct 
cttctggaattggtgggattcttcatatcccaactagctact^ 

aaaagggagaactctaaaagggatcaacatctcagccaatcgcggacaaaccaccatcataaatgctagccttga 
aa ^T gcgaccctcaatacaaaCagct ^ 

gcccaatatcgttaatatctttgatacagttcitacagatagtcagctagagtcaacagagaatcacntccacgct 
ga ^^ ta j ccaagtccatg ^^ 
a 5 c E aaggaa " AACTGGGACATCTCaagcaactat ^ 
agaggtacggaattaagcaacccttacaaaactgaaaaaaccgatgtcaaggatcaactcattgcgagatctgat 

GATAATATTGATCTAATATCCACACCAAGCAGACGTTGA 
T^IXJIHISYHPSLSAHHD^^ 

QKESQRlNWDlSSNYGSIFQFTREKPESRGTEl^NPYKTEKTDVKDQLlARSDDNIDUSTPSRkz 
55 ID30 879 hp 



A J GA £ A £ AAGAATGG ^^ 
A 5 AGG 1I GGA 5 ACAAGGCTGAAGAAACGATAGC 

£59- GTCCCTC A A G CTG A AGTCG AATTG G AA AG CC AG C A AG AAG AG AAAATTG AAG CTCCTG AAG AC AGTG AA 

?^ A ^ A f^^™ AAG ^ AAAAGAAGGCAT 

AAA ^^ A ™ A ^ CTGA ^™ GCCAAGAAGCTOT 

£ AG 1^ T H^ 

^ A ^ C ^^^^OCJCATTCTCTTTTCTQCATCrTC L\ M 1 iLl 1 ' 1 AGTATCTATCACATCAAACATGCTTACTAT 
GGACATATAGCAAGCATTAACAGTCGCTTCCCTGAGCAGCTAGCTCCTTTAACTCTXrm 



35 



55 



60 



4 



38 



TTTCTTTGCTrrCTTTGATAG^^ 

5 ^a^S^^ 

'"TSKIOTTHSYTAIUUU^ 
QEKDWTLDKVL<WYSQU^?PBf!^ 

10 ID10S 990 hp 

30 f^™™,,^^^ 



ID 107 -78hp 



A^ATATGTA^TGAAGCACGGAGGCAGCAG^CGTGCTGGGGATGCAGAGTGGGGGAGGGACGCTGCrArrr 

^ MICKMKQGGSRACWGWRVGEGRCYFN 



ID109 714 hp 



VTAATGG 



45 c^aa^^ 

caggagttatcggaaaatcgcttg^c?atta^a^cg^aaagatt^ 

AGCTAGGCTATTGAACGTTTTrGATTrrGCTG^^ 

50 ATCCCCAGGCTGCCnTGATTTOAAACAGAT^ 

CTCATGTCCTAGaGGTGGCAGAGCaaGTCTGTGATC^ 
TAAGGTAGAGGACTTGAGGAAAGACCAC^ 

AGAGGAGGTTGCGGATGCGTCTCAAGGTCA^aA ^^^^^ GGaaa CTATCTACCTTAGTCTTGCTGGTAGAAA 
FD112 360 hp 



ACTTGGCTAAAGAAACTACTGCTAGTCTCTTCaGTTG^TG'^^ 
TCAAGGCAACTTGCTCATCrmGAAA?G^ 

65 agcccgatagcgatagctgtatcttcttccccagtt^g^aacca^gttctact^ 



4 



39 



MALFSERGAVRCTPMASPIMRPMMVPT.EIKRVIPAP^ 
FEM VS MLALIWURLSFLRSMAIA VSSS P VLKPGSTZ 

ID 128 - 3.43 

ATGAAATTTAGTAAAAAATATATACCACCTOGATCAGCTGTTATCCTATC 
CTTGAGTCTATGTGCCTATGC ACT AA ACC AG C ATCGTTCGCAGG AAAAT A 

aggacaataatcgtgtctcttatgtggatggcagcca<gtcaagtcagaa^a 
agtgaaaacttgacaccagaccaggttagccagaaagaaggaattcaggc 

tgagcaaattgtaatcaaaattacagatca^cta^ 
gtg accactatcattactataatggg aaagttccttatga^ 
agtgaagaactcttgatgaaggatccaaactatcaacttaaagacgctga 
ta^gtc^tgaagtcaagggtggttatatcatcaaggtcgatggaaaat 

ATTATGTCTACCTGAAAGATGCAGCTCATGCTGATAATGTrCGAACTAAA 

gatgaaatcaatcgtcaaaaacaagaacatgtcaaagataatgagaaggt 

Ja^c^aatVtgctgtagcaaggtctcagggacgatatacgaca^tg 

atcgttatctctttaatccagctgatattatcgaagatacgggtaatgct 

tatatcgttcctcatggaggtcactatcactacattcccaaaagcgattt 

Itctgctagtgaattagcagcagctaaagcacatctggctggaaaa^ta 

tgcaaccgagtcagttaagctattcttcaacagctagtgacaataacacg 

caatctgtagcaaaaggatcaactagcaagccagcaaataaatctcaaaa 

TCTCCAGAGTCrrrrGAAGGAACTCTATGATTCACCTAGCGCCC^AACG^ 
ACAGTGAATCAGATGGCCTGGTCTTTGACCCTGCTAAGATT 

A^CCAAATGGAGTTGCGATTCCGCATGGCGA^ 

TTaCAGCAAGCTTTCTGCCTTAGAAGAAAAGATTGCCAGAATGGTGCCTA 

TCAGTGGAACTGG^ACAGTTTCTACAAATGCAAAACCTAATGAAGTA 

GTGTCTAGTCrAGGCAGTCTTTCAAGCAATCCTTCTC 
TAAGGAGCTCTCTrCAGCATCTGATGGTTATATTTTTAATCCAAAAGATA 

tc^ttga^aaacggctacagcttatattgtaagacatggtgatcatttc 

catTIca^c^a^atcaaatcaaattgggcaaccgactc^cc^caa 

tagtctagcaacaccttctccatctcttccaatcaatccaggaacttcac 

ItgTgaaacatgaagaagatggatacggatttgatgct^ 

gctg^agatgaatcaggttttgtcatgagtc^cggagac 

tttag 

MKFSKKYlAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVbGSQSSQK 
SE^^TPDQ^QKEGIQAEQWIKTTIXJGYVTSHGDHYHYYNGKVPYDALF 

IIellmTpnyqlkdad^nevkggyiikvdg^ 

DE^kSKOEHVKDNEKVNSNVAVA^QGRYTTNDGYVFNPADIIEDTGNA 

y^phgghyh^ipjSdi^aselaaakahlagknmqpsqi^ysstasdnnt 

tpngvaj^gdhyhfipysklsaleekiarmvpisgtgstvstnakpnev 
vsslgslssnpsslttskelssasdgyifnpkdiveetatayivrhgdhf 
hyipksnqigqptlpnnslatpspslpinpgtshekheedgygfdanrii 
aedesgfvmshgdhnhyffkkdlteeqikvrkni* 



4 



40 

TABLE 2 



CGGTrrTGAACGACC^GTO G G ^ r C r A r ^ 

FD 3 6360 hp 

^^^^ 

GATGCCAAAGCCTCAiTrArT^^^ 

a tccctca¥gc^ 

ACCCACATGGGATGCAT^^Ar^ 

'OATAAAirGUSS^ACTScS^S 0 ^^ 
CATGGTAAGCAAATTGTVrcA^A^r^ CACA ^ AAA ACT^ 



4 



10 



15 



20 



25 




30 




50 



55 



60 



65 



4 




*cacag 



45 



20 



35 



3D 



60 



65 



4 



43 



ACAGCCCTACGTGAATTAGGAGGTCAATTATCCTCTATT^ 

TTAATCCAATTGAGTCVVATAAAATGAATGATATCGAACCAAATGAGGTGAATAGAGATTTTCCGTTATATGAAG 
C A A A A A A T ATTT G CT AT AAGT ATGG AG AT A AAG A AAT ATT A A A A A ACIT AA ATTTTTGTTTTC AA CGT AATG AAAA 
GTATTTAATTTTAGGTGAAAGTGGAAGCGGGAAATCTACATTATTAAAA 
D AGTGGAGAATTGCGATTC TGCGG GGATGATATAAAAAAAACCTCCTATrrAAATATGGTTTCGAATGTTCTATATG 
TAG ATC AAA A AGCTT ATTTGTTTG A AG GT A CG ATT AG AG AT A AT ATTTT ATTGG AAG A AAATT AT A CTG ATG AAG A 
AATACTACaGTCTTTAGAGCAAGTTGGTTTGAGTGTAAAAGATTTTCCTAATAACATTTTAGATTATTA 
GATGATGGGaGATTACTGTCAGGAGGGCAGAAACAAAAAATTACTTTAGCTAGAGGGCTAATTAGAAATAAGAA 

aatagtattaattgacgagggaacttctgctatcgataggagaacttcgttagcgattgaacctaagatattaga 

1U TAGAG AG G ATTTG ACTOTCATTATTGTTACCCATGCTCCGCATCCGGAACTTAAACAATATrTTACT 

caatt ice aaaggattttatttaa 

MYTIIKSNIKKFSLLTnnVAGQLLLIYAATINALVLNELUMN^^ 

EIRNRVATDISNSTYQEFHSKSSGTYl^WLNNDVQTLNDQAFKQLFLVIKGISGTIFAVVTLNHYHWSLTVATLFSLMIM 
15 LLVPKIFASKMREVSLNLTNQNEAFLKSSEmNGFDVIj\S^^ 

FFQISLVFLTGYI^IKGrVKIGTIEAIGALTGVIFTALGELGGQL^SIIGTKPIFlJCLYSINn'IESNKMNDIEPNEVNRDFPL^ 
AXNICYKYGDKEILKNLNFCFQRNEKYLILGESGSGKSTLLKLLNGFLRDYSGELRFCGDDIKKTSYLNMVSNVLYVDQ 

kaylfegtirdnilleenytdeeilqsi^qvglsvk;dfpnnildyyvgddgri.lsggqkqkitlarglirnkkivlidegt 
saidrrtslaierkildredltviivthaphpelkqyftkiyqfpkdfi2 



1D9 705 hp 



ATAACAGTTAAACAGATTATGGACGAAATAGCCGTTTCAGATATGACTGCAAGGCGCTATTTACAGGAATTAGCT 
GATAAAGATTTGCTGATTCGTGTGCATGGTGGAGCTGAAAAACTrTCGAACCAACrrCCCTTTTGACTAATGAGC 
ZD C AAAT ATTG A A AAACAAG CCCTC C A AACG G CA G AAAAAC AAG AA AT AG CC C ATTTTGC AGG C AGTCT AGT AG AA 

GAAAGAGAAACTATTITCATTGGACCAGGAACAACATTAGAGTTTTTTGCGCGTGAGTTGCCTATTGACAATATC^ 
GCGTCGTAACCAACAGTCTACCTGTTTTTCTGATTTTA^ 

AAATTATCGCGATATTACAGGTGL1 I I"I G'rrGGTACATTGACCCTACAAAATCTCTCTAATCTCCAATTTTCTAAA 
GCTTTCGTTAGCTGTAATGGTATTCAAAACGGAGCTCTAGCTACTTTTAGCGAGGAAGAGGGAGA 

ATCGCTTT AAAT AATTCT AAT AAAAAATATTTACTCGCAGATCATAGCAAGTTCAATAAGTTTGATTTITA^ 

TTATAATGTATCAAATCTTGATACTATTGTTT^ 

ACATTAAAGTCATCAAGCCTTAA 



rrVKQIMDEUVSDMTARRYLQELADKDLLIRVHGGAEiCLRTNSLLTNERSNIEKQALQTAEKOEIAHFAGSLVEERETI 
FIGPGTTLEFFARELPIDNIRVVTNSLPVFLILSERKLTDLILIGGNYRDITGAFVGTLTLQNLSNLQFSKAFVSCNGIQNGA 
LATFSEEEGEAQRlALNNSNKKYLLADHSKFNKFDFYTFYNVSNLDTIVSDSIOSDStLFKLSICHIKVIKPZ 

ID10 483 bo 

40 ATGACTGACTTTTCGTTAGATCTTCrrCTAGAAGCC 

AG CT A G A C AA AA C AG AT AA AG ACC A AG AG CTT AA AACTG AA ATTC AATCC ATCTTT AT CG AACA C AAGG G AAATT 
ATGCTTATCGCCGGGTTCATTTAGAACTAAGAAATCGTGGTTATCTGGTAAATCATAAAAGAGTTCAAGGCTTGaT 
GAAAGTACTCAATTTACAAGCTAAAATGCGAAAGAAACGAAAATATTCTTCTCATAAAGGAGACGTTGGTAAGAA 
GGCAGAGAATCTCATTCAAGCCCAATTTGAAGGCTCTAAAACAATGGAAAAGTGCTACACAGATGTGACTGAATT 

4D TGCCATTCCAGCAAGTACTCAAAAGCTTTACTTATCACCAGTTTTAGATGGC^ 
AATL1 J J L i'l GTTCGCCTAATTTAG AATAA 

MTEFSLDLLLEAIKLAR^TTYYYHLKQLDKTD^ 
50 ^££Q AKM RK K RK YSSHK CDVG^ 



IP14 1266 bp 

ccaggatttggtaccgttgcaagtggtgtgcctitcctcctaaa 

CA TTCA GATATCAAACTITGCTAAGGTATTGGTCAAGGATGAAGATGAAAAAAATCGCTTGCTTGCAGCAGGGAAT 

GACTTTAACTTTGTAACCAATGTGGATG AT ATTTT ATCAGACCAGGAT ATT ACTATCGTAGTGGAATTGATGGGGC 

GTATrGAGCCTGCTAAAACCTTTATCACTCGTGCCTTGGAAGCTGGAAAACACGTrGTTACTGCT 

TTTAGCTGTCCATGGCGCAGAATTGCTAGAAATCGCTCAAGCTAACAAGGTAGCACTTTACTACGAAGCAGCAGT 

TGCTGGTGGGATTCCAATTCTTCGTACTTTAGCAAATTCCTTGGCTTCTGATAAAATTACGCGCGTGC^ 

GTCAACGGAACTTCCAACTrCATGGTGACCAAGATGGTGGAAGAAGGCTGGTCTTACGATGATGCTCTTGCGGAA 

G^AACGTCTAGGAT TTGCA GAAAGCGATCCGACGAATGACGTAGATGGGATTGATG<^GCCTA(^GATGGTT 

ATTTTGAGCCAATTrGCCITTGGCATGAAGATTGCCTTTGATGATGTAGCCCAC^ 

CAGAAGACGTAGCTGTAGCTCAAGAGCTTGCTTACGTAGTGAAATTGGTTGGTTCTATTGAGGAAACTTCT^ 

TATTGCTGCAGAAGTGACTCCAACCTTCCTACCTAAAGCGCACCCACrTGCTAGTGTGAATGGCGTAATGAACGCT 
GTCTITGTAGAATCTATCGGTATTGGTGACTCTATGTACT^ 



4 



44 

AGTGTTGTAGCTGATATTGTCCGTATCGTTCGTCGTTTGAATGA 

GCCGTGACTTGGTCTTGGCAAATCCTGAAGATGTCAAAGCAAACTACTATTTCTCAATCTTGGCT 
AGGTCAGGTCXrGAAGTTGGCrGAAATCITCAATGCTCAAG 

GAGGGTGAC AAGG CGCGTGTCGTTATCATCACaCACAAGATTAATAAAGCCCAGCTTGAAAATGTCTCAGCTGAA 
5 TTGAAGAAGGTTTCAGAATTCGACCTCTTGAATACCTTCAAGGTGCTAGGAGAATAA 

P GFGTV ASGVPFlXKENGGKINQSAiiSDIKVAKVLVKDEDEKNRJLL^ 
AKTF^'RALEAGKHVVTA^^CDLLAVHGAELLE1^ 

MVTKMVEEGWSYDDALABAQRLGFAKDPTNDVDGIDAAYKMVIl^QFAFGMKlAFDDVAHKGlRNnTEDVAV 
10 LGYWKLVGSIEETSSGIAAEVTITTLPKAHPLASVNGVMNAVFVE^ 
TOGTIGKDFNEYSRDLVI^NPEDVKANYYFSILALDSKGQVLK^ 
QLENVSAELKKVSEFDLLNTFKVLGE2 



15 



40 



45 



50 



55 



ID16 1725 bo 



AT GAAACA CCTATTATCTTACTTCAAACCCTACATCAAGGAATCAATTTTAGCCCCCTTGTTCAAGCTGTTAGAAG 
CToi i i i iGAGCTCTTGGTTCCCATGGTGA TTGC TGGGATTGTTGACCAATCTTTACCTCAGGGAGATCAAGGTCA 
TCTCTGGATGCAGATTGGCCTGCTCCTTATCTTTGCAGTAATTC 

CAGC AAAGGC AG CAGTAGGTTCTG CT A AGG AATTG AC A A ACG ATCTTT ATCGTCAT ATTCTTTCCTTGCCCAAGG A 
20 CAGCAGAGACCGTCTG ACAACTT CTAGTTTGGTCACTCGCTTGACTTCGGATACCTACCAGATTCAGACT 
AATCAATT CCTG CGTCTCTTTTTACGAGCGCCCA TTATC GTTTTTGGTGCCATTT^ 
TGAG TTGACTTTCTGGTTCTTAGTCTTGGTTGCCATTTTGACCATTGTCATTCT 
' CTTTCTACA GTAGT CTCAGAAAGAAAACGGACCAACTGGTTCAGGAAACGCGCCAGCAATTGCAAGGGATGCGGG 
TTATTCGTGCTTTTGGTCAAGAAAAACGAGAGTTACAGATTTTTCAAACCCTTAACCAAGTTTATGCTA 
25 AGAAAAGACACGTTTCTGGTCTAGTTTATTAACACCTCrGACCTAT 

ATCTGG CAAG GCTATATTTCAATTCAAGGAGGAGTGCTCAGTCAAGGTGCTCTCATTGCTCTTAT 
TACAGATTTTGG TGGA ATTGGTCAAGCTAGCCATGTTGATCAATTCCCTCAACCAGTCCTATATCTCAGTCAAGCG 
A ATCGA GGAAGTCTTTGTTGAGGC TCCA GAGGATATCCATTCAGAGTTAGAACAAAAGCAAGCTACCAGAGATAA 
GCnTTTACAAGTCCAAGAATTGACCTTTACCTATCCTGATGCGGCCCAGCCTTCTCT 
30 ATGACTCAAGGACAAATTCTAGCrrATCATCGGGGGAACTGGTTCTGGTAAATCAAGCrTGGTGCAACT 

GACTTTATCCAGTAGACAAGGGGAmACATTGACCTTTATCAAAATGGACGTAGTCCTCTTAATTTGGAGCAGTGGC 

GGTCTTGGATTGCCTATGTACCTCAAAAGGTCGAACTCTTTAAAGGAACCATTCGTTCCAACTT 

CAATCAAGAAGTATCTGACCAGGAACTCTGGCAGGCCTTGGAGATTGCGCAAGCT 

GGAAGGACTCTTGGATGCTCTAGTTGAGGCAGGGGGGCGAAATTTCTCAGGTGGACAAAAACAAAGATTGTCTAT 
35 CGCCCGAGCAGTCTTGCGCCAGGCTCCGT TTCT CATCCTAGATGATGCAACCTCGGCACTGGATACCATTACAGAG 
TCCAAGCTCTTGAAAGCTATTAGAGAAAATTTTCCAAACACGAGCTTAATTTTGATCT 

TACAGATGGCGGACCAGATTCTCCTCrrGGAAAAAGGTGAGTTGCTAGCTGTTGGCAAGCACGATGACTTGATGA 
AATCCAGCCAAGTCTATTGTGAAATCAATGCATCCCAACATGGAAAGGAGGACTAG 



MKHLLSYFKPYIKESII^PLFKLLEAVFELLVPMVlAGrVDQS^ 
AVGSAKELTNDLYRHIl^LPKDSRJDRI/TTSSLVTRLTSDTYQIQTC 

VAILTIVIVGl^RLVNPFYSSLRKKTDQLVOETRQQLQGMRVIRAFGOEKRELQIFQTLNQVYARLQEKTGFWSSLLTPL 
TYLIVNGTLLVIIWQGYlSKX^Vt^QGALlAUNYL^^ 

ATRDKVLQVQELTFTYPDAAQPSLRYISFDMTQGQILGIIGGTGSGKSSLVQLLLGLYPVDKGNIDLYQNGRSPLNLEQ 

WRSWIAYVPQKVELFKGTIRSNLTLGFNOEVSDQELWQALEIAQAKDFVSEKEGLLDALVEAGGRNFSGGQKQRLSIA 

RAVLROAPFLILDDATSALDTrTESKLUCAlRENFPOTSLILlSQRTSTLQMADQILXLEKGELLAVGKHDDLMKSSOVYC 
EINASQHGKEDZ 

IDI8 1224 bp 



ATGAAACGTTCTCTCGACTCAAGAGTCGATTACAGTTTGCTCTTGCCAGTATnTTTCT 

GGCTATCTATATAGCCGTTA GTCA TGATTATCCCAATAATATTCTGCCCATTTTAGGGCAGCAGGTCGCCTGGATT 
GCC^GGGCTTGTGATTGGTTTTGTGGTCATGCTCTTTAATACAGAA l 1 Hi i 1 GGAAGGTGACCCCCTTTCTATA 
TATTrTAGCCTTGGGACTTATGATCTTGCCGATTGTATT^ 

AACTGGGTATCAATAAATGGAATTACCCTATTCCAACCGTCAGAATTTATGAAGATATCCTATATCCTCATGTTGG 
CTCGTGTCATTG TCCA ATTTACAAAGAAACATAAGGAATGGAGACGCACGGTTCCGCTGGA C1 1 i 1 IGl ' l AATTTT 
CTGGATGATTCTCTTTACCATTCCAGTCCTAGTTCTTTTA 
TAGCCATTTTCTCAGGAATCGTTTTATT^CAGGGGTTTCTTG 

ACAGGAGTTGCTGG l'l'l C l'l AG CT ATCTTT ATTAGCAAGGACGGACGAG C1 ' IT fC ri CACCAGATTGGAATGCCGA 
OU CCTACCAAATTAATCGGATTTTGGCrTGGCTCAATCCCTTTGAGTTTGCCCAAACAA^ 

agggcagattgcc attgg gagtggtggctratttggtcagggatttaatgcitcgaatctgc^ 
gactcagatatgatttttacggttattgcagaagattttggctttattggctct 
catgttgatttaccgtatgttgaagattactcttaaatcaaataaccagttctacacttatat^ 
t tatg atgttgctcttccacatctttgagaatatcggtgctgtgactggactacttc 
cctttcatttcgcaagggggatcagctattatcagtaatctgattggtgttggtttgctt^ 



65 



20 



60 



65 



4 



45 



GACTAATCTAGCTGAAGAAAAGACCCGAAAAGTCCCATTCAAACGGAAAAAGGTTGTATTAAAACAAATTAAATA 
A 

MKRSLDSRVDYSLLLPVFFLLVIGWAIYIAVSHD^^ 

D GLMILPrVFYNPSLVASTGAKNWVSINGrTLFQPSEFMKISYILML^RVIVQFTKKHKEWRRTVPL^ 
VLLALQSDLGTALVFVAIFSGIVLUJGVSWKI^ 

AQTTTYQQAQGQiAIGSGGLFGQGFNASNLLIPVRESDMIF™^ 
ISTGLIMMLLFfflFENIGAVTGLLPLTGIPLPFISQCGSAllS^ 

10 \D22 987 bo 

ATGGTCGCTAAGAAAAAAATCTTATTTTTTATGTC 

CGA ™^ CAAATCTGGATC ^^ 

,c ^I^GTTCCA^ 

13 TGGAGAATGAGAATTTATTTTCCAAGACTGACTCGTCGTTTGCITCTAAAAG 

TTACCATTATGAATCCACCaCTGTTGTTCTCTAAAj\GAAGAGAAGTCAAGAj\GATATCTTGGATTCATGGAAGTAT 
TGAAGAACTTCTTAAGGATAGCTCTAAAAGAGAATCACATAGAAGCCAGTTGGATGCTGCGAATACAATTGTAGG 
5 ti^TJ C AA A A AAG ACC A G C A ATTCT ATC A AG G A AGTTT ATCC AG ATT AT ACTTCT A A ATT AC AG AC A ATCT ACAAT 
^-Z^?^ A ]EX3T^S^5 A ^^*^"'~^^^~ i ^'^ A A A A ATCTC AAG A G AAG A.XCG AT AXCG AG ATTG CTCCTCA AAGT ATCTCX A 
CTAT<XGACGGATTGAGGA^ATAAGGGTTCTGACCGTOTAGTGGAAGTGATACG 

-^AACTATCATCTCTATTTTATCGGGGCTGGTGATATGGAAGAGGAACTGAAAAAACGAGTCAAAGAGTATGGGA 
TTG AGG ACTATGTAC ATTTCCTTGGTTATC A AA A A AATCCTT ATC AGT ATCTATCTC AG ACG AAA G TT CI 1 ' 1 ' 1 GTCT 
ATGTCTAAACAAGAAGGTTTTCCTGGAGTGTATGTGGAGGCCrTGAGTCTGGGACrCCCTTTTATCTCT 
,c ^ GGAGGGGCTGA GGAATTATCCCAAGAAGGACGATTTGGACAAATCATrG 
/D ^ GA " ACTAA " ACATGAOTCTGC CT<=AAACTTTGATG 

TTACAAAACAAATCGAACAAGTAGAAAAACTATTAGAGGAGTAG 

MVAKJCKIU=FMWSFSLGGGAEKILSTIVSNLDPEKYDID!LEMEHFDKGYESVPKHVimJCSLODYROT^ 
RIYFPRLTRRLLVKDDYDVEVSFnMNPPLLFSKWlEVKKlSWIHGSIEELLJCDSSKRESHRSQL^ 
30 EVYPDYTSKLQTIYNGYDFQTILEKSQEKIDIEIAPQSICTIGIUEENKGS 

KKRVKEYGIEDYVHFLGYQKNPYQYI^QTKVLl^MSKQEGFPGVYVEAi^LGLPnSTOVGGAEELSOEGRFGOnESNO 
EAAQ A ITN Y MTS AS NFD VDEASQFIQQFTITKQIEQ VEKLLEEZ V V W 

35 ID23 1434 bp 

A I GGAAA 5I G S A ^ AA ^ AGTGTGA ^ GTG ^ 

^CAGAAGCAG^CTATCAAAATCrGGAAATTATTCTTGTTGATGATGGTC 

tgattcaatcgctgaacaagatgacagggtgtcagtgcttcataaaaagaacgaaggattgtcgcaagcacgaaa 

TGATGGGATGAAGCAGGCTCACGGGGATTATCTGATTTT^ 

4U CAGAGCTTATATGAGCAATTAGTrCAAGAAGATGCGGATGTTTCGAGCTGTGGTGTCATGAATGTCTATGCTAATG 
A I GAAAGCCCACAGTCAGCCAATCAGGATGACTA *n^ 

AGGTGAAAAAATACCTGGGACGATTTGCAATAAGCTAATCAAGAGACAGATTGCAACTGCCCTATCCTTTCCTAA 
555?"^*^* ATTT ACG A AG ATG CCT ATT ACC ATTTTG ATTX A ATC A A GTTGGCC A AG A AGT ATGTGGTT AATA CTAAA 
CGC I A P A ^ ACTA ^ C CATAGAGGGGATAGTATTACGACCAAACCCTATGCAGAGAAGGAm 
« ATATCTACCAAAAGTTTTATAATGAAGTTGTGAAAAACTATCCTGACTTGA 

^^ISSSS^ 1 1 1 A ^ G ^ GGA ^ AAGA ^ G ^GCTAGATGATCAGTATAAACAGTTTGAAGCCTATTCTCAGATT 

$ A i GG irnzL AAAAGGccATGc 

TGGCCCTATTCATAAATATTrCCTTATATCGATTCTTATTACTGAAAAATATTGAAAAATCTAAAA 

50 

KQAHGDYLIFIDSDDYIHPEMIQSLYEQLVQEDADVSSCGVMNVYANDESPQSANQDDYFVCDSOTFLKEYLIG^PG 
TlCNKLIKRQIATALSFPKGLIYEDAYYHFDLlKLAKKYVV^^ 

NIEJCSKKL.HZ 



TD24 735bo 



A I2 A 5 AATCAAAGAGAA J^ CAATAATA ^ AATGGA GGAATA^ 
A TI^ GA ^ GATATAAA ^^ 

5^ JT A AA ^ 

ATTTAATCGAAGAACCAAAATTATTTrTATCTAAAACAGGTTTAGAGAATITAAAATAT^ 
! G ]7 GA i7/£^ 

AA G A CCT ATTCTTTG GGT AC A A A ACAAAAATT A G CTTTGCTTCT A ACTCTCGTT A CGG AAC CTG AT AT ATTG ATTT 



60 



t 



46 



TAGATGAACCGACTAATCGTTTAGATATTGAATCATCACAAATAGTTTTAGCCCTTCTA 

TGAAAATGT GGGAA TTTTAATATCGAGTCATAAATrAGAAGACATTG.\AGAAATTTGTGAGAGA Ql I ' Jl 1 li rTG 
GAGAACGGGC 1 1 1 II G ACAT TTCAAAAAGTAGG AAAAGATAGTCATAA II 1 C l ' l O Vl ' l GAG AT AG CTTTTTC ATC AG 
^ CTACAGATAGAGACATTTTCATTACCAAACAAGAATTTTGGGATATTGTTTAG 

MRIKEKTNNINGGIKWSKHYGHSIIUCDI^ 

KLFLSkTGI^NLKYLSNLYGVDYNQERFRCLIQEUDLTQStN^ 

SSQrVLAVUCKI^HEWGILISSHKl^DffiEICERVLFLENG^ 

10 ID25 1704bp 

ATGACTGAATTAGATAAACGTCACCGCACT 

GTGCGACTGGTATGACAGATAAGGACTTTGAAACATCGATTGTGGGAGTGATTTCGACTTGGGCGGAAAATACAC 
CATGTAACATTCACTTGCATGATTTCGGGAAACTGGCTAAAGAAGGTGTCAAATCTGCAGGCGCTTGGCCTGTAC 

10 AGTTTGGAACCATTACCGTAGCGGACGGGATCGCTATGGGAACGCCTGGTATGCGTTTCTCTCTAACATCTCGTGA 
CATCATCGCGGACTCCATCGAGGCGGCTATGAGTGGTCACAACGTGGATGCCTTCGTCGCTATCGGTGGCTGTGA 
CAAGAACATGCCTGGATCTATGATTGCTATTGCTAATATGGATATCCCAGCTATTrrCGCCTATGGTGGA^ 
GCACCGGGAAATCTTGATGGTAAAGATATCGACTTGGTTTCTGTCTTTGAAGGTATCGGAAAATGGAAC 
GACATGACAGCTGAGGACGTGAAACGTCTTGAATGTAATGCCTGCCCTGGCCCrGGTGGTTGTGGTGGTATGTAT 

ZU ACTGCTAATACCATGGCAACTGCTATCGAAGTTCTAGGGATGAGTTTGCCAGGGTCATCCTCrCACCCAGCTGAAT 
CAGCTGATAAGAAAGAAGATA TCGAA GCAGCAGGACGTGCTGTTGTTAAGATGTTGGAACTTGGTCTCAAACCAT 
CAGATATCTTGACTCGTGAAGCCTTTGAAGATGCTATCACTGTAACGATGGCTCTCGGTGGTTCTACAAACGCCAC 
TCTrCACTTGCTCGCCATTGCCCATGCCGCAAATGTTGACTTGTCACTTGAGGACTTCAATACGATTCAAGAACCT 
GTGCCTCACTTGGCCGACTTGAAACCATCTGGTCAGTATGTCTTCCAAGACCTCTACGAAGTCGGTGGTGTCCCTG 
CGGTTATGAAGT ATTT GTTGGCAAATGGTTTCCTTCACGGAGATCGCATCACATGTACTGGTAAGACTGTAGCTGA 
AAACTTGGCTGACTTTGCAGACTTGACTCCAGGCCAAAAAGTTATCATGCCACrTGAAAATC^ 
TGGTCCGCTTATCATCTTGAACGGGAACCrTGCTCCTGACGGTGCAGTTGCCAAGGTATCAGGTGTTAAAGTGCGT 

cgtcacgttgggccagctaagg tcttt gactcagaagaagatgcgattcaggccgttctgacagatgaaatcgt^ 

GATGGCGATGTAGTCGTTGTTCGTTTTCrrTGGACCTAAAGGTGGTCCrGCrrATGCCTGAGATC 
-5U AATGATTGTTGGTAAAGGTCAGGGAGATAAGGTGGCCCTCTTGACGGACGGACGTTTCTCTGGTGGTACTTATGGT 
CTGGTTGTTGGACATATCGCTCCTGAAGCTCAGGATGGTGGACCAATTGCCTATCTCCGTACCGGCGATATCGTTA 

cggttgaccaagataccaaagaaatttctatggccgtatccgaagaagaacttgaaaaacgcaaggcagaaaca 
accttgccaccactttacagccgtggtgtcctcggtaaatatgcccacatcgtatcatctgcttcacgcggagccg 
^ tgacagacttctggaatatggacaagtcaggtaaaaaataa 

MTELDKRHRSSIYDSMVKSPNRAMLRATGNrTDKDFETSIVGVISTW 

TrrVADGlAMGTPGMRFSLTSRDHADSIEAAMSGHNVDAFVAlGGCDKNMPGSMlAlANMDIPAIFAYGGTlAPGNLTC 
KDIDLVSVFEGIGKWNHGDMTAEDVKRLECNACPGPGGCGGMYTANTrMATAIEVLGMSLPGSSSHPAESADKKEDIE 
AAGRAVVKMLELGLKPSDILTR£AFEDArr\n*MALGGSTNAT^ 

40 YVFQDLYEVGGVPAVMKYU^NGFLHGDRrrCrGKTVAENLADFADLTPGQKVIMPLENPKRADGPLIILNGNl^PDG 

AVAKVSGVKVRRHVGPAKVFDSEEDAIQAVLTDEIVDGDVWVRFVGPKGGPGMPEMLSLSSMIVGKGQGDKVALLT 

DGRFSGGTYGLVVGHUPEAQDGGPUYLRTGDIVTVDQDTKE1SMAVSEEELEKRKAETTLPPLYSRGVLGKYAHIVSS 
ASRGAVTDFWNMDKSGKK2 

45 ID26 274bp 

ATGTTATAATAAAAATAAAGAATTTAAGGAGAAATACAATATGTCAATTTTTATTGGAGGAGCATGGCCATATGC 
AAACGGTTCGT TACAT ATT GGTCA CGCGGCAGCGCTTTTACCGGGGGATATTCTTGCAAGATACTATCGTCAGAA 
QGGAGAGGAAGTTTTATATGTTrCTGGAAGTGATTGTAATGGA 
DU TAAGTCTGTGAAAGAAATTGCTGATTTTTATCATAAGGAATTTAATCCA 

55 ID28 1065bn 



atgacaacattattttcaaaaattaaagaagtaacagaacttgctgcagtctcaggtcatgaagcgcctgtccgt 
gcttatcttcgtgaaaagttgacaccgcatgtggatgaagtggtgacagatggcttgggtggtatttttggtatca 
aacattcagaagctgtggatgcaccgcgcgtcttggtcgcttctcatatggacgaagttggttttatggtcagcga 
aatcaagccagatggtaccttccgtgtcgtagaaatcggtggctggaaccccatggtggttagcagccaacgttt 

CA^CTC^GACTCGTGATGCrrCATGAAATTCCTGTGArrrCAGGTTCT 

gggggaccaaccatgccagccattgccgatatcgtttttgatggtggttttgcggac^ 
tttggcatccgtcctggtgataccattgtaccagatagttctgcaattttgacagccaatgaaaaaaatatcatct 
caaaagcttgggataaccgctacggtgtcctcatggtaagcgagctagctgaagctttatcgggtcaaaaact 
od gcaatgaactctatctgggttctaacgtccaagaagaagttggtctgcgtggcgctcatacctctacaaccaag^ 
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tgacccagaagtct tcct cgcagttgattgctcaccagcagctgatctctacggtggtcaaggcaagattggaga 
tggaaccttgattcgtttctatgatccaggtcacttgcttctcccaggga 

gaagaagctggtatcaagtaccaatactactgtggtaaaggcggaacagatgcaggtgcagctcatctgaaaaat 
ggtggtgtcccatcaaca acta tcggtgtctgcgctcgttatatccattctcaccaaaccctctatgcaatggatg 

ACTTCCTAGAAGCGCAAGCi i l I tACAAGCCTTGGTGAAGAAATTGG ATCGTTCAACGGTTGAnTGATTAAACA 
TTATTAA 

N^LFSKJKEVTEI^VSGHEAPVRAYLREKLTPHVDEVVTDGLGGIFGIKHSEAVDAPRVLV 

IX3TrevVEIGGWNPMWSSQRFKLLTRIX3HEIPVlSGSVPPHLTRGKGGPTMPAIADIW^ 

IVPDSSAn-TANEKNIlSKAWDNRYGVLMVSEI^EAI^GQ 

PAGDVYCGQGKJGDGTURFn>PGHUXPGMKDFLLTTAEEAGIKYQ 

IHSHQTLYAMDDFLEAQAFLQALVKKLDRSTVDUKHYZ 

ID31 ll82bp 



ATGGAATTTTCTATGAAATCAGTCAAAGGACTACTCTTrATCATAGCTAGTTTTATCTTG 
GAACACTTCTCCCCAATTCATGATTCCAGGACTAGCTTTAACAAGCCTATCTCTGACTTTTATC 
CTCCCACTACTAGAAAGCTGG TTTCA CAGTTTGGAGAAGGTCTACACCGTCCACAAATTCACAGCCTTTCTCTCAA 
TCATCCTACTAATCTTTCATAACTTTAGTATGGGCGGTTTGTGGGGCTCTCGCTTAGCT^ 
20 GCCATCTATATCTTTGCCAGCATCATCCTTGTCGCCTATTTAGGCAAATACATCCAATACGAAGCTTGGCGATGGA 
TTCACCGCCTGGTTTACCTAGCCTATATTTTAGGACTCTTTCACATCTACATGATAATGGGCAATCGTCTCCTTACA 
TTTAATCTTCTAACl 1 1 I CI 1C1 1 GGTAGCTATGCCCTTTTAGGCTTACTAGCTGGTTTITATATCATTTTTCTATAT 
C A AA A G ATTTCCTTCC CCTATCT AG GG AAA ATT A CCC ATCTC AAACG CTT AA ATC ACG AT ACT AG A G A AATTC A A 
ATCCATCTTAGCAGACCTTTCAACTATCAATCAGGACAATTTGCCTTTCTAAAGATTTTCCAAGAAGGC 
25 GTGCTCCGCATCCCTTTTCTATCTCAGGAGGTCATGGTCAAACTCTTTACTTTACTGTTAAA^ 

TACCAAGAATATCTATGATAAT CTTCA AGCCGGCAGCAAAGTAACCCTAGACAGAGCTTACGGACACATGATCAT 
AGAAGAAGGACGAGAAAATCAGGTTTGGATTGCTGGAGGTATTGGGATCACCCCCTTC 
ACATCCTATTTTAGATAAACAGGTTCACTTCTACTATAGCTTCCGTGGAGATGAAAATGCAGTCTA 
CTCCGTAACTATGCTCAGAAAAATCCTAATTTTGAACTCCATCTAATCGACAGTACGAAAGACGGCTATCTTAATT 
TTGAACAAAAAGAAGTGCCCGAACATGCAACCGTCTATATGTGTGGTCCTATTTCTATGATGAAGGCACTTGCCA 
AA C AG ATT A AG AA AC AA AATCCAAA A AC AG AG C AT ATTT AC 



MEFSMKSVKGLLniASnLTLLTVmOTSPQFMIPG^ 
NFSMGGLWGSRLAAQFCNI^IYIFASIILVAYLGKYIQYEAWRWIH^^ 
J5 YALLGLLAGFYIIFLYQKISFPYLGKITHUCRLNHDTREIQIHLSRPFNYQSGQFAFLKIFQEGFESAPHPFS1SGGHGOTLY 
FTVKTSGDHTKNIYDNLQAGSKVTLDRAYGHMIIEEGRENQVWIAGGIGITPRSYIREHPILDKQVHFYYSFRGDENAV 
YLDLLRNYAQKNPNFELHLIDSTKDGYLNFEQKEVPEHATVYMCGPISMMKALAKQIKKQNPKTEHIY 



IP32 900bp 

ATGACi i I 1 AAATCAGGCTTTGTAGCCATTTTAGGACGTCCCAATGTTGGGAAGTCAAC 

TGGGGCAAAAGATTGCCATCATGAGTGACAAGGCGCAGACAACGCGCAATAAAATCATGGGAATTTACACGACTG 
ATAAGGAGCAAATTGTCTTTATCGACACACC\GGGATrCACAAGCCTAAAACAGCTCTCGGAGArrrCATGGTTG 
A* ^5 T ^ GCCTACAGTACCmCG CGAAGTGGACACTGTTCITITC 

45 GGACGATATGATTATCGAGCGTCTCAAGGCTGCCAAGGTTCCrGTGATTTTGGTGGTGAATAAAATCGATAAGGTC 
CATCCAGACCAGCrCTTGTCTCAGATTGATGACTTCCGTAATCAAATGGACTTTAAGGAAATTGTTCCAATCT 
CCCTTCAGGGAAATAACGTGTCTCGTCTAGTGGATATTTTGAGTGAAAATCrGGATGAAGGTTTCCAATATTTCCC 
GTCTGATCAAATCACAGACCATCCAGAACGTTTCTTGGTTTCAGAAATGGTTCGCGAGAAAGTCTTGCACCr 

en CG T GA ^ GAGA " CCGC ^^ 

5U ATCCGTGCAACCATCATGGTCGAGCGCGATAGCCAAAAAGGGATTATCATCGGTAAAGGTGGCGCTATGCTTAAG 
AAAATCGGTAGCATGGCCCGTCGTGATATCGAj\CTCATGCTAGGAGACAAGGTCTTCCTAGAA 
GTC A AG AA AA A CTGG CG CG AT A AA AA G CT AG ATTTGG CTG ACTTTGG CT AT AATG A AAG AG A AT ACT AA 

MTFKSGFVAILGRPWGKSTFLNHVMGQKIAIMSDKAQTTRNKIMGIYTTDKEQIVFIDTPGI^ 
55 TLRBVDTVLFMVPADEARGKGDDMIIERLKAAKVPVILVVNKIDKVHPDQL^ 

RLVDII^ENl^EGFQYFPSDQrTDHPEI^LVSE^REKVLHLTREEIPHSVAVVVDSMKRDEETDKVHIRATIMVE^ 
KGniGKGGAMUOCIGSMARRDIELMLGDKVFLETWVKVKKNWRDKKLDLADFGYNEREYZ 

ID33 855 bp 

60 

CTGCl I CI ION 1 1 1 acagaaggaggacttatgcctgaattacctgaggttgaaaccgtttgtcgtggcttagaaa 

AA "^" G ATT AT A GG AAAG AAG ATTTCG AGT AT AG AAATTCG CT A CCCC AAG ATG ATT AAG A CG G ATTTGG AAG AGT 
TTCAAj\GGGAATTGCCTAGTCAGATTATCGAGTCAATGGGACGTCGTGGAAAATATTTGCTTTTTTATCTG 

05 GCCCATGTTTTCTTTCATTTTGAAGATGGTGGCACCCITGTT^ 
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tcttggtcc ctga clll 1 1 ag acgtct actttatttctaaaaaatta 
tttacaggtctttcaatctgccctrgccaagtccaaaaagcctatcaaatcc^ 

gctggacttggcaatatctatgtggatgaggttctctggcgagctcaggttcatccagctagaccttcccagactt 
tgacagcagaagaagcgactgccattcatgaccagaccattgctgttttgggccaggctgttgaaaaagct 
cc^ccattcggacttataccaatgcctttggggaagatggaagcatgcaggactttcatcaggtcta^^ 
ctggtcaagaatgtgtacgctgtggtaccatcattgagaaaattc^ct 
aaactgtcaaaggagggactga 

mllvftxgglmpelpevetvcrgleklngioiissieirypkmiktdleefqrelpsqliesm 
rmegkyfyypix^gperkhahvffhfedggtlvyedvrkfotmellvpdlldvyfiskklgpepseo 

KSKKPIKSHLLD0TLVAGU3NIYVDEVLWRAQVHPARPSQTLTA£EATAIHDQT1\VLGQ 
GSMQDFHQVYDKTGQECVRCGTIIEKIQLGGRGTHFCPNCQRRD2 

ID34 633bp 

ttgtccaaact gtcaa aggagggactgat ggga aaaatcatcggaatcactgggggaattgcctctggtaagtca 
actgtgacaaattt tcta agacagcaaggctttcaagtagtggatgccgacgcagtcgtccaccaactacagaaa 

CCTGCTGGTCGTCTCnTTGAGGCn^CTACAGCA 

gccctctcctagctagt ctcat cttttcaaatcctgatgaacgaoaatggtctaagcaaattcaaggggagattat 
ccgtgaggaactggctactttga gaga acagttggctcagacagaagaga a 1 1 111 1 catgg atattcccctactt 
tttgagcaggactacagcgattggtttgctgagacitggttggtctatgtggaccgagatgcccaagtggaacgc 
ttaatgaaaagggaccagttgtccaaagatgaagctgagtctcgtctggcagcccagtggcctttagaaaaaaag 
aaagatttggccagccaggttcttgataataatggcaatc^gaaccagcttcttaatcaagtgcatatcct^ 
agggaggtaggcaagatgacagagattaa 

MSKl^EGLMGKnGrTGGlASGKSTVTNFLRQWFQVVD^^ 

ASLIFSNPDEREWSKQI(^EUR£ELATLREQI^QTEEIFFMDIPIXFEQDYSDWFAETWLVYVDRDAOVER1J4KRDO^ 
KDEAESRLAAQWPLEKKKDLASQVLDNNGNQNQLLNQVHILLEGGRQDDRDZ 

30 iD3? nmp 

TTGATAATAATGGCAATCAGAACCAGCTTCTTAATCAAGTGCATATCCrrCTTGAGGGAGGTAGGCAAGATGA 
GA GATTAA CTGGAAGGATAATCTGCGCATTGCCTGCrTTTGGTAATTTTCTGACAGGAGCCAGTA l 1 1 LUTl 'GGTTG 

TACii i i l atgcccatcttcgtggaaa atcta ggtgtagggagtcagcaagtcgctttttatgcaggcttagcaat 

TTCTGTCTCTGCTATTTCCGCGGCGCTCTTTTCTCCTATTTGGGGTA 

TGATGATrCGGGCAGGTCTTGCTATGACTATCACTATGGGAGGCTTGGCCTTTGTCCCA^ 

ill 1 HI CGTTTACTAAACGGTGTATTTGCAGGTTTTGTTCCTAATGCAACGGCACTGATAGCCAGTCAGGTTCCA 
AAGG AGAA ATCAGGCTCTGCCTTAGGTACTTTCTCTACAGGCGTACTTGCAGGTACTCT 

GTGGCTTTAT CGCA G AATTATTTGG CATT CGTACAG '1 1 1'1'LM'rACTGGTTGGTAGTTTTCTATTTTTAGCTGCTATTT 
TGACTATTTGCTTTATCAAGGAAGATXITCAACCAGTAGCCAAGGAAAAGGCTATTCCAACAAAGGA^ 
CTCG GTT A A AT ATC CCT ATCTTTTG CTC AATCTCTTTTT A ACC AGTTTTGTC ATCC A ATTTTC AG CTC A ATCG ATTG 
GCCCTATTTTGGCrCTTTATGTACGCGACTTAGGGCAGACAGAGAAT Ll 1C1 1 l ' l ' l GTCTCTGGTTTGATTGTGTCC 
AGTATGGGCTTTTCCAGCATGATGAGTGCAGGAGTCATGGGCAAGCTAGGTGACAAGGTGGGCAATCATCGTCTC 
TTGGTTGTCGCCCAGTTTTATTCAGTCATCATCTATCTCCTCTGTGCCAATGCCTCTAGCCCCCTTCAACTAGGACT 
CTATCGTTTCCTCTTTGGATTGGGAACCGGTGCCTTGATTCCCGGGGTTAATGCCCTACTCAGCAAAATGACTCC^ 
AAAGCCGGCATTTCGAGGGTCTTTGCCTTCAATCAGGTATTCTTTTATCTGGGAGGTGTTGTTGGTCCCATC 
GTTCTGCAGTAGCAGGTCAATrrGGCTACCATGCTCrrCTTTTATGCGACAAGCCrTTGTGT^ 
TTT AACCTG ATTC AATTTCG AAC ATT ATT AA AAGT AAAGG AAATCT AG 

50 MIIMAIRTSFLIKCISFLJIEVGKMTEIKWKDNLRIAWFGNFLTGAS1SLVVPFMPIFVENLGVGSQQV 
AALFSPIWGIl^DKYGRKPMMlRAGl^MTITMGGLAFVPNIYWLIFL 
TI^GVVAGTLTGPFIGGFIAELFGIRTVFIXVGSFLFLAAILTICT^ 
FVIQFSAQSIGPILALYVRDLGQTENIXFVSGLIVSSMGFSSMMSAGVMGKLGDKVGN^ 

55 a ^£^j^^^^ 

ID36 1311bp 



ATGGCCCTACCAACTATTGCCATTGTAGGACGTCCCAATGTTGGGAAATCAACCCTATTTAATCGGATCGCTGGTG 

AGC £*^ CTCCA ^ GTAG ^ GATCTC ^ 

GTTCTTrrAGCATGATTGATACAGGAGGAATTGATGATGTCGATGCTCCTTTCATGGAACAAATCAAGCACCAGGC 
AGAAATTGCCATGGAAGAAGCAGATGTTATCGTTTTTGTCGTGTCTGGTAAGGAAGGAATTACTGATGCAGACGA 
ATACGTAGCTCGTAAGCTTTATAAGACCCACAAACCAGTTATCCTCGCAGTCAACAAGGTGGACAACCCTGAGAT 
GAGAAATGATATATATGATTTCTATGCTCTCCGTrrGGGTGAACCATTGCCTATCTCATCTGT^ 

ACAGGGGATGTGCTAGATGCGATCOTAGAAAATCTTCCAAATGAATATGAGGAAGAAAATCCAGATGTCATTAAG 
TTTAGCTTGATTGGTCGTCCTAACGTTGGAAAATCAAGCTTC^ 
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CTACTCCTGTTGCTGGAACAACTCCTGATGCCATTGATA^ 

g^ttca^cggctgotatgcgtaactctggtaaggttt 

CGTCCTA^GACCGTrCAGATGTGGTCTTGATGGTCATCAATGCGGA 
ATCGCAGGATTTGCCCATGAAGCTGGTAAAGGGATGA^ 

aaccacactatgaaaaactgggaagaagatatccgtgagcagttccaatacctgcct^ 

CTATCAGCrrrAACCAAGCAACGTCTCCACAAACTrCCTGAGATGATTAAGCAAATCAGCGAA^ 
CGTATTCCATCAGCTGTCTTGAACGATC^^ 

AAACOTCTCAAGATTrrCTATGCGACCCAAGTGGCAACCAAACCACCAACCnTCT 

AACTCATGCA C 1 U I CI I 'ACCTGC Ol nCl TGGAAAATCAAATCCGCAAGGCCTTTGi i i i i GAGGG AACACCGAT 
TCATCTCATCGCAAGAAAACGCAAATAA 

MALPTIAIVGRPNVGKSTLFNRJAGERISIVEDVEGVTRI^RIYATGEWL 

EEADVrVFVVSGKEGrTDADEYVARKLYKTHKPVILAVNKVDNPEMRNDIYDFYALGLGEPLPISSVHGIGTGDVLDAl 
VENLPNEYEEENPDVIKFSLIGRPNVGKSSLINAILGEDRVUSPVAGTTRDAIDTHFTDTDGQEFTMIDTAGMRKSGKV 

YEOTEKYSVMRAMRAIDRSDV\^MV1NAEEGIR£YDKRJAGFAHEAGKGM 

FQYLPYAPHFVSALTKQRLHKLPEMIKQISESQhnTUPSAVLNDVIMDAlAINPTPTM 

NEEELMHFSYLRFLENQIRKAFVFEGTPIHLIARKRKZ 

1D37 714hp 

ATGACAGAAACCATTAAATTGATGAAGGCTCATACTTCAGTGCGCAG 

r: ac-ft a a aTH AH aTTTTGACAGCAGCCCAGATGGCATCATCTTGGAAGAATTTCCAATCCTACTCTGT GATTGT GG 
TACGAAGTCAAGAGAAGAAAGATGCCTTGTATGAATTGGTACCTCAAGAAGCCATTCGCCAGTCTGCTGTTTTCCT 
TCTCTTTGTCGGAGATTTGAACCGAGCAGAAAAGGGAGCCCGACTTCATACCGACACCTTCCAACCCCAAGGTGT 

ggaaggtctcttgattagttcggtcgatgcagctcttgctggac 

tatggtggtgtgattatcggtttggttcgatacaagtctgaagaagtggcagagctctttaacctacctgactaca 

cctattctgtctttgggatggcactgggtgtgccaaatcaacatcatgatatgaaaccgagactgccactagaga 

atgttgtctttgaggaagaataccaagaacagtcaactgaggcaatccaagcttatgaccgtgttcaggctgact 

atgctggggcgcgtgcgaccacaagctggagtcagcgcctagcagaacagtttggtcaagctgaaccaagctcaa 

ctagaaaaaatcttgaacagaagaaattattgtag 

MTETIKLMKAHTSVRRFKEQEIPQVDLNEILTAAQMASSWKNFQSYSVTVVRSQEKXDALYELVPQEAIRQSAVFLLFV 
GDLNRAEKGARLHTDTFQPQGVEGLLISSVDAALAGQNALLAAESLGYGGVHGLVRYKSEEVAELFNLPDYTYSVFG 
MALGVPNQHHDMKPRLPLENVVFEEEYQEQSTEAIQAYDRVQADYAGARATTSWSQRLAEQFGQAEPSSTRKNLEQK 

KLLZ 
ID38 729bp 

ATGACAGAAATTAGACTAGAGCACGTCAGTTATGCCTATGGTCAGGAGAGGATTTTAGAGGATATCAACCTACAG 

GTGACTTCAGGCGAAGTGGTTTCCATCCTAGGCCCAAGTGGTGTTGGAAAGACCACCCTCTTTAATCTAATCGCTG 

GGATTTTAGAAGTTCAGTCAGGGAGAATTGTCCTTGATGGTGAAGAAAATCCCAAGGGGCGCGTGAGTTATATGT 

TGCAAAAGGATCTGCTCTTGGAGCACAAGACGGTGCTTGGAAATATCATTCTGCCCCTCTTGATTCAAAAGGTGG 

ATAAGGCAGAAGCTATTTCCCGAGCGGATAAAATTCTTGCGACCTTCCAGCTGACAGCTGTAAGAGACAACT 

CTCATGAACTTAGCGGTGGGATGCGCCAGCGTGTAGCCTTACTCCGGACCTACCTTTT^ 

CTTAGATGAGGCCTTTAGCGCCTTGGATGAGATGACAAAGATGGAACTCCACGCTTGGTATCTTGAGATTCACA^ 
GCAGTTGCA.GCTAACAACCCTGATCATCACGCATAGTATTGAGGAGGCCCTCAATCTCAGCGACCGTATCTATATC 
TTGAAAAATCGCCCTGGGCAGATTGTTTCAGAAATTAAACTAGATTGGTCTGAAGATGAGGACAAGGAAGTCCAA 
AAGATTGCCTACAAACGTCAAATTTTGGCGGAATTAGGCTTAGATAAGTAG 

NrTEIRLEHVSYAYGQERILEDINLQVTSGEVVSILGPSGVGKTTLFNLlAGILEVQSGRIVLDGEENPKGRVSYMLQKDLL 
LEHKTVLGNIILPLLIQKVDKAEAISRADKILATFQLTAVRDKYPHELSGGMRQRVALLRTYLFGHKLFLLDEAFSALDE 
MTKMELHAWYLEiHKQLQLTmiTHSIEEALNl^DRIYILKNRPGQlVSEIKLDWSEDEDKEVQKlAYKRQIl^ELGl^K 

Z 

1D39 2433bo 

ATGAACTATTCAAAAGCATTGAATGAATGTATCGAAAGTGCCTACATGGTTGCT 

TAnAGTCGTGGCACTTGTTGATTGCCATGTCTAATCACAGTTATAGTGTAGCAGGGGCAACTTTAAATGA TTAT CC 
GTATGAGATGGACCGTTTAGAAGAGGTGGCTTTGGAACTGACTGAAACGGACTATAGCCAGGATGAAACCTTTAC 
GGAATTGCCGTTCTCCCGTCGTTTGCAGGTTCTTTTTGATGAAGCAGAGTATGTAGCGTCAGTGGTCCATGCTAAG 
GTACTAGGGACAGAGCACGTCCrCTATGCGATTTTGCATGATAGCAATGCCTTGGCGACTCGTATCTTGGAGAGG 
GCTG U1 1 1 1 111 1 ATGAAGACAAGAAAGATCAGGTCAAGATTGCTGCTCTTCGTCGAAATTTAGAAGAACGGGCA 
GGCTGGACTCGTGAAGATCTCAAGGCTTTACGCCAACGCCATCGTACAGTAGCTGACAAGCAAAATTCTATGGCC 
AATATGATGGGCATGCCGCAGACTCCTAGTGGTGGTCTCGAGGATTATACGCATGATTTGACAGAGCAAGCGCGT 
TCTGGCAAGTTAGAACCAGTCATCGGTCGGGACAAGGAAATCTCACGTATGATTCAAATCTTGAGCCGGAAGACT 
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ID41 762hp 
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•rT-T^raTrv^CGTCTGAGGGTCAOT 



KYV1KWKRS2 



rN L «IIJIRHISUjGFLGV15IWQLA^^ 

_MDSLT 
ILWHFK 
i/IKWKR 

IP42 372bp 

MIFNPlCCMIREKKGDRDMAFTNTHMRSASFGINrrSLPDDIIDSFWYIIDHFLKNVFELEEELEFQLLNNOGKITFHFSSQ 
HLPTAIDFDFNHPFDPRYPPRVLVLDMDGRET1LLPEENDLFZ 

1D43 1569bP 

Ar-ArrrrTfiTCATTCTATCTArmAAGAAAAGTAATAATCA^ 

T^AATGCAA^ACTAATGATrrAGATAATTCACCAACTGTrAATCAGAATCC^ 

T^C^CCM^TC^AGATAATTC^ 

rACAATTAGACAACAGAACAG^GAATCTACACTAACATC^ 

TaAGTG^CAGAATTATGAAAAAAGAA^^ 

goattca^gatgat^gacaagcaattcaagatgcaata 

ATTTrOCTG^AAGGAACTTATTTAGTAAAAGAAATTGTTTTTT^ 
AfVTV^AA^CTAAATGGTATAAATATTAAGAATCACCCTTCCATTGTTTTT 

t^aca^gTcga^cagaa^ctctaatatta^ 

GjWCAACCAAATATTGAATTATTACGAGTTAGTGaTAATTTaGTAGTCTCAGAGAATAGT 

VV1AENI1FNIADPKTIOURVAKDS 
ID44 324bP 

GTGATGAAAGAAACTCAGCTATrAAAAGGTGTTCTTpAAGGT^ 



STtggTTaTCAGTTGGTTCAGACTTTGCGAGAGGCTGGATTTGATaCTATCGTTCCAGGAACTA 

gcaa^gt^ag^^ 
agggattaagaatgggggttaa 
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ID45 filfihj 




^^CTTGTTCCGTGGGTTT^ 
ID46 348hp 

ID47 12fif»h p 
CTGGCACGAAAAATG^AGA^^ 

ID48 7Q<Thp 



53 

GTTTTGTCTTTCAGCAGTTCTTTCTTCTATCGA 



CTCAGTTAGGAAAGGAGGAAAACTAA 

l^s^lade^gaT^gnqimqu-vdlnkegktii^ 

ID49 1200bD 

^GTACAOTA^CAGAAGCGCAGGCGGCCtATGATTCAGCTACTCGAGCAGTAGCTAGGGCAGATCCT 
COTGATG^GCCTG?A^TG^C^C^ 

aSctIgagggaact^ 

catattgtcaggC^g^vaaatttacaagtcaagggagaattgt 
aag^g^aagc^a^ctaaagtgtatcctgataaaaaa 

TAXA^CAATG^AAGCAGCTAGTCCAGCAGCCGGGAATAATACAGGTTCTAAATACCCrrAT^ 

A^^r^nmGCTGATTTGAAAC^ 

§£cCTG C £S 

AAAGTTGAGGTTTCATTGGGAAATGCTGACGCAGAAAATCAAGAAATCACTTCTG^TTTAA 

qtcatcagtaatccaacatcttccttggaacaagcaaaagaggtgaaggctgatgaagcaactaattac 



MKKKNGKAKKWQLYAAIGAASVWLGAGG1LLFRQPSQTA1XDEPTHLWAKEGSVASSV 

£oWGE^EYNLA^ 

G^VNIEVKS^^ 

VKADEATNZ 

IPSO 759bD 

atgtcacgtaaaccatttatcgctggtaactggaaaatgaacaaaaatccagaagaagctaaagcattcg^^gaa 

GCAG^GclT^AAOTCCTTCATCAGATCTTGTrGAAGCAGGTATCGCTGC^A^ 

t^cttgctgttgcaaaaggctcaaaccttaaagttgctgctcaaaactgctactt 

TGGTGAAACTAGCCCACAAGTTTTGAAAGAAATCGGTACTGACTACGTTGTTATCGGTCACTC 

cta^c^gXaact^ 

^l^^cTGAATCAOTGAAACTrACGAAGCTGGTAAAGCTGCTGAA 
^G^GGA^CTGCTC^AAG^GCTGCCTCACnTATCGCrrATGAGCCAATCT 

rATcScIcAA^GA^^^^ 

AAGTCGCAGACAAAGTTCGTGTTCAATACGGTGGTTCTCrrrAAACCTGAAAATGTTGC^ 

^qaCGTTGACGGTGCCCITGTAGGTGGTGCGTCACTTGAAGCT 

TAA 

e^a^viXy¥piwaict 
vggasleaesflalldfvkz 

IDS1 1473bo 

TTGAAAACAAAAATTGGATTAGCAAGTATCTGTTTACTAGGCTTGGCAACTAGTCATGTCGCTGCAAATGAAACTG 

aIgtagcaa^cttcgc^gatacaacgacagcttcaagtagttcagagcaa 
aa^cgagcgc^aTg^acagactaatgctgctgcccactgggatgggga^tt 
aagctcaaagtgaatggatttttgacaactactataaggcttggttttatattaattcagat^ 
gaatca^tcgcatcgaaattactacctgaaatcaggtggatatatggcccaaaacgagtggatc^^^ 



54 

CGAATGCTTTAGAATGGGAAAACCCTcY^^ 
AAGGAATCCAAGGGCGCGTAGATGTCAGCGT^GCTA^ 

TD52 774bp 

TGC^^S C G^ 

GGGACACTCATGCTGATTTTGGCTCA^ACATTTTTCTrGn^ 
CT-mGCTGCCATrTACATCTACCAGTC^ 

GGTTGCGCCTGACCrrTATGATTGGTGCTAGCrff-^^ 

CCGTTCTATCTACrrCACGTGGOTC^^^ 

IGGCGACATGATTCATGCGGCCTATGACT^ 

g c c^ g ^^^ 

ID59 107^ hp 

^A^T^^^^ 

GACTCAGTrrACAGAAGAAACAGGAAT^CA^ 
^I^ GCAGCCT ^AACGACCTACGATA^CC^^CCAA^r 




I^^ ACCTaGaag ^ ATGag ^^gaccaV^ 

AAATGTATCGGAAGTAG ^ TGGACAGGGa ^TATAGCGAOT 

KPEYKNSIMLFDGAREVLGUJLNSLGYSLNS^?^ 
QMLEKNENLRYVVFTEASNL^NMV^ 

kafypdvetmkhlevyekfdhkwtgkysd^lSf^ 
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55 



ID61 ISSIbp 

ATf" a aTaaaaaaCTAACAGATTATGTGATTGATCTGCTGGAAATTTTAAATAAACAACAAAACCAGo i li ictgg 

rGA^Im^TAmclGTATGGTGGT^ 
ACCTGTl^ACTACATTATCTATACGAGTTTGGCCTTCCTG^ 
rr^?A^AGTCG^ACAGCAAGATTACGGATTTCATGAAAATCTTrT^ 
ATATAG^^TCTGTTATG^CCrrCTTGCCACTCTrCTCCATCCGTT^ 

TCG^ACcSCTTGATTGffrGCCGCTGATGGTGGGGCTCITTTTATGCATAGTTACCA^ 

gaactggtcggtattxtggataaggattctaagaaaaagggtcaaaaacttggtggtat^ 
JtcaSattoctoa^agccaaacgccatcaaatcga 

ag^atatgagcgtatcttgcagatgtgtaataagctgggtgtcaaatgttacaagatgcct 

TGTTC^G<GGOCn^ACC7WVGCAGGTACTGGCTrCCA 

CGTC^GACGAATCGCGTCTGGGTGCAGAACTGACAGGTAAGACCATCTTAGTCACAGGAGCTGGA^ 
GGTTCTGAAATCTGTCGTCAAGTTAGTCGCTTCAATCCTGAACGCATTOT 

cStgatcgt^gttgc/^gtctttg^ 
c^ItgItggTg^ca^ 

CATCAAGCr^AOTGTCTAAGATGGTrATGATTT« 
AC^ScraSGAGTTGA 

ggaatgttcttggtagccgtggtagtgtcattccagtctttgaacgtcagattgctgaaggt 

■ GACAGACTTCCGTATGACCCGTTACTTTATGACCATTCCAGAAGCTAGCCG^ 

gcca^agatgg^aaVt^^ 

C^CTA^GTGG^ 

gaaictcttggtatcaaccgaactcgttgataatcaagttatggataagattttcgttggtaaggtta^ 

ctf^gaatccatcaatcaaaagattggagagttccgcactctcagtggagatgagttga^ 

cctttgctaatcaaacaacccacattgaataa 

MNKKLTDYVUiLVEILNKQQKQVFWGiroiFSMVVSnVSYILFYGUN^^ 
SfMKIFF^ 

^RYFmiP^SRLVIHAGAYAKIXJEVFILDMGKPVKIYDLAKKMVL^GHTESEIPIVEVGIRPG 
DNQVMDKIFVGKVNVMPLESINOKtGEFRTLSGDELKQAHAFANQTTHIEZ 

ID101 1338bp 

,— r . a( -rttatgatagttacagtcaag aaagtcg ag attt acatgaaagtctagtcgctactggtctttctc 
aIc^gag^tcat^^^ 

rA^TGGAAAACCTCTCT 

c^ota^gaX^^^ 

G^AGACTGCA^ 

aacgac^atagcg^cagatagcgagccgattatgacagtttaccaagatgtcaatggtcaacaagt^ 

aaaccatgtgacgggtgatatcttattgactttgccaggtcagtccatgcgttactttg 

atcacc^ct^gcaagatttggaaatagataccagtcagcttatctttaat^ 

^C^raATCATCCAGATAAATCT^ 
AAT^CAG^GArrrrGGAAAGTGATAATCT^ 

CGCGCmA^GAGTrAACTGACGAGAAATACCATGATCAGTTrGTGCACTTGGGTTATCA^ACCA^ 

ataATTTCCTAAGACGAGATGCCTTAATCTTGACCAATTCAGATCAGATTGAGCAAGTAGAAGC^ 

rrxTGCCTGATGTCACTTTCCGTATTGCAGCGGTGACAGAGATGTCTTCTAAGCTCTTAGACA 

aaTgtggccc^accagaacgctagtccacagaaga^ 

^AACCACAGTAATGAGTTGCTACAGGCAGTGCGTCAGGCCTITGAGCACAATCTCTrGAT^ 
GaCGGTGCACAATAGACTTTATATCGCTCCAGACCATCTATTTGAAAGTAGTGAAGT^ 

^yp^^^XTGGCCCTnCAGATGTTGATCAAATGCGTCAGGCACTTGGCAAACAAGGCCAACATGCAAATTATGTTG 

acttggtgagatatcaggaaaccatgcaaactgttttaggaggctaa 
e^s^lZlcypI^a^^^^ 

SSEVAALVET1KLALSDVDQMRQALGKQGQHANYVDLVRYQETMQTVLGGZ 
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^^^^^ 



1DI03 229?Kp 

cTGCTATr A S GAG SS2^ CGAAAGT CAGCGGATC^^ 
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gaaatcgtgactcattttccataa 

M«!_SnOELVAKTVEFRQRl^EGESLDDILVEAFAWR£ADKRJLGMFPYDVQVMGAlVMHYG 
SyLNDNI^NEEGKFLRPF^^ 

QSASQKNPIVEYYQEAYAGFEAMKEQIHADMVRNLLMGLVEVTPKGEIVTHFPZ 
ID104 879bp 

atgaaacaagaatggtttgaaagtaatgattttgtaaaaacaacaagcaagaacaagcctgaagagcaag 
agaggttgcagacaaggctgaagaaaggatacccgatctcgatacaccaattgaaaaaaatactca 

aGGAA^TCTCTCAAGCTGAAGTC^AATTGGAAAGCCAGCA^ 

g1£aG^cI?aaa^ 

AAAAGTCACTA^AdCTGAAGAGAGCCAAGAAGCT^ 

n^AAATrmA^GA^AAGTCCrTATATCCCCGACCAAGCrCCAAAATCTAGGGATAAATGGA^ 

IcAGCCTITCT^GCTCATrCTGTTrrCTGCATCTrC Ll 1 11 IL1 1 1 AGTATCTATCA^C A^CATGCTACTAT 
nr.A^ATATAGC^G^CA^AACACrrCGCTTCCCTGAGCAGCTAGCrCCm 

AGTAGCGAC^c^cTC^cVrcrrrrcATrccTcrr^ 
gactggacgctaga^aIg^ctccaacaatatactcaact^^ 
TTTCTTTG Cri ten i gatagcctacgatttacagccctcttgtgtgtga 

MKOEWFESNDFVKTTSKhKPEEQAQEVADKAEERIPDLDTPIEKOTQLEEEVSQAEV^LESQQEEKIEAPEDSEA^ 

eSs^eeepd^etekvtiaeesoealpqqkattkepixiskslespyipdqapksrdk^e 

QEKDWTLDKVLQQYSQLLAIPISSLLLLVSLLSL1AYDLQPSCV2 
1D106 3Z7bp 

ATGTACTrrCCAACATCCTCTGCCTTGATTGAATTTCTCATCTTGGCTGTACTGGAGCAGGGTGATTCTT 

tgag!?^cca^cca^^ 

IggISIgS^G^ 

TGGTA^AGAGCAGCTCTTGACCCTAAAAGATGAATGGGCACTCTATACAGACACCATCAATGGCATCATAGAAGG 

gagtatccgccatgacaagaactga 

myfptssaliefulavleqgdsygyeisqtiklianikestlypilkklegnsflttysrefqgrmrkyysltnggieqllt 
lkdewalytdtinghegsjrhdknz 

ID108 9S4bp 

ATGGATTTTGAAAAAATTGAACAAGCTTATATCTATTTACTAGAGAATGTCCAAGTCATCCAAA^ 
CCAAC^TTTATGACGCCTTGGTGGAGCAAAATAGCATCTATCTGGATGGTGAAACTGAGCTAAA 

ACA^CAATCA^GCC^ 

aggctgggcaaacagaacccttgcaggccaatcaccagtttacaccggatgctattgctt^ 

TGTGGAAGAGTTGTTTAAAGAGGAGGAAATTACTATCCTCGAAATGGGTTCT 
m^GA^CTCG^ 

agc^ggcagaVgtaattggtttgcaggct^ 

AGCGATGTGGTCATCAGTGACTTGCCTGTCGGCT 

ctagccaagaIcatacttacgcccatcacttgctcatggaac 
ctatttttctagctccgagtgatttgttgaccagtcctcaaagtgatttgttaaaagaat 

GAGTCTGG^GCTATGArrAGTCTGCCT^ 

ATTTAAAGAAAATTrTCAAAAATGGACTCAAGGTACTGAAATATAA 

MDFEKIEOAYIYLLEhA'OVIQSDl^TNFV'DALVEQNSIYLDGETELNQVKDNNQALKIUJVLRKEEWUCTYQFIih^ 
G^LQANHO^FTPDAIALLLV 
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QLYGsegayasiltmgaigsiigallaskikaniynlijllaltgvgvfmm 
qtkvt^eflgr\t-st7ftlailfmpiakgfmtvlpsvhl^ 

ID113 2466bp 

ATGCAAAATCAATTAAATGAATTAAAACGAAAAATGCTGGAATTTTTCCAGCAAAAACA^ 

GCTAGACCTGGCAAGAAAGGTTCAAGTACCA AAAA ATCTAAAACCTTAGATAAGTCAGCCATTTTCCCAGCT 
TTACTGAGTATAAAAGCCTTATTTAACTTACTCTTTGTACTC^ 

CTTTGGGATACGGAGTGGCCTTATTTGACAAGGTTCGGGTGCCTCAGACAGAAGAATTGGTGAATCAGGTCAAGG 

:^^ ct H tctat ^ CAGAGattac ^ 

TTCTATCTCATCTGAGCAAATTTCGGAAAATCTGAAGAAGGCTATCATTGCGACAGAAGATGAACACTTTAAAGA 
ACATAAGGGTGTAGTACCCAAGGCGGTGATTCGTGCGACCTTGGGGAAATTTGTAGGTTTGGGTTCCTCT 
GGTTCAACCTTGACCCAGCAACTAATTAAACAGCAGGTGGTTGGGGATGCGCCGACCTTGGCTCGTAAGGCGGCA 
1« GAG ^ GTGGATGCTOT GCCTTGGAACGCGCCATGAATAAAGATGAGAT^ 

SE^ GGCCGAAATAATAAGGGACAGAAT ^^ CA 

CCAGTCAGTTGACTGTTCCTCAAGCAGCATTTTTAGCAGGACTTCCACAGAGTCCCATTACTTACTCTCCTT 

AAATACTGGGGAGTTGAAGAGTGATGAAGACCTAGAAATTGGCTTAAGACGGGCTAAGGCAGTTCTTTACAGTAT 

GTATCGTACAGGTGCATTAAGCAAj^GACGAGTATTCTCAGTACAAGGATTATGACCTTAAACAGGACTTTTTACC 
nr\ ATGGGGCaCGG ™ CaGGaa ^CACGAGACTA 

ZU GACTATCTAGCTCAGAGAGACAATGTCTCCGCTAAGGAGTTGAAAAATGAGGCAACTCAGAAGTTTTATCGAGAT 

TTGGCAGCCAAGGAAATTGAAA-ATGCTGGTTATAAGATTACTACTACCATAGATCAGAAAATTCATTCTGCCATG 
CAAAGTGCGGTTGCTGATTATGGCTATCTT^ 

G AT AACC AAAC AG GTGCT ATTCTAGGCTTTGT AGGTGGTCGT AATT ATC Aj\G AAAATCA AAATAATCATGCCTTTG 
ATACCAAACGTTCGCCAGCTTCTACTACCAAGCCCTTGCTGGCCTACGGTATTGCTATTGACCAGGGCTTGATGGG 
ZD AAGTG AAACG ATTCTATCT AA CT ATCC AAC AAACTTTG CT A ATGG C AATCCG ATT ATGTATG CTA ATAG C AAGG G 

^CAGGAATGATGACCTTGGGAGAAGCTCTGAACTATTCATGGAATATCCCTGCTTACTGGACCT^ 

^I GAAAAGG £?^ 

TTGCCAATGGCTGGTGGTATTGAAGTC^ 

GTTTATCATCAGAAGCATGTGATTTCAAAGATTGAAGCAGCAGATGGTAGAGTGGTGTATGACTATCAGGATAAA 
CCGGTTCAAGTCTATTCAAAAGCTACTGC^CGATTATGCAGGGATTGCTACGAGAAGTTCTATCCTCTCGTGTGA 
CAACAACCrrCAAGTCTAACCTGACTTCTTTAAATCCTACTCTGGCT 

AACCAACCAAGACGAAAATATGTGGCTCATGCTTTCGACACCTAGATTAACCCTAGGTGGCTGGATTGGGCATGA 
TGATAATCATTCATTGTCACGTAGAGCAGGTTATTCTAATAACTCTAATTACATGGCrCATCTGGTAAATGCGATT 
CAGCAAGCTTCCCCAAGCATTrGGGGGAACGAGCGCTTTGCTTTAGATCCTAGTGTAGTGAAATCGG 
3D AAATCAACAGGTCAAAAACCAGAGAAGGTTTCTGTTGAAGGAAAAGAA 

AGCTATTGGGCTAATAAGTCAGGAGCGCCAGCGACAAGTTATCGCTTTGCTATTGGCGGAAGTGATGCGGATTAT 

CAGAATGCTTGGTCTAGTATTCTGGGGACrrCTACCAACTCCATCCAGCTCCAGCAGTTCAAGTA 

ATAGCAGTAACTCAAGTACTACACGACCTTCTTCTTCAAGGGCGAGACGATAA 

40 MQNQLNELKRKMLEFFQQKQKNKKSARPGKKGSSTCKSCT 
CVALFDKVRVPQTEELVNQVKDISSISErTYSTC^ 
ATLGKFVCLGSSSGGSTLTQQLIKQQVVGDAPTl^RK^ 

ROAAEGIFGVDASQLTVPQAAFI^GLPOSPrTYSPYEOTGELKSDEDLEIGLRRAKAVLYSMYRTGALSKDEYS^ 
DLKQDFLPSGTVTGBroYLYFTTLAEAQE^ 

^ A ^2f A Y^ 

LMGSETILSNYPTNFANGNPIMYANSKGTGMMTLGEALNYSWNIPAYWTYRMLREKGVDVKGYMEKMGYEIPEYGIE 

S^MGGGIEVTVAQHTNGYQTLANNGVYHQKHVISKIEAADGRVVYEYQDKPVQVYSKATATIMQ^ 

T!^"> TSLN ^^ NADWIGCT ^ 

50 lc S ™£l^ 

ID114 1974bn 

S S ATG^^TTrTATGTAAGTCC^TTTITCCTATrCTACTAGGATTG ATTGCGTTTGGAGTCTrATCCACnTCAT 

55 TAt I I 1 loll AATAATAATCTGTTGACGGTTTTAATnTG 1 1 1L1 1 U I GTAGGAGGCTATGnTTTTTATTTAAGAA 

A ?££ A ™ AA C AG A TC< ^TAGGTC^^ 

*n T1 GG ^J CCGTCTAC ^ ATGCCAAGC ^GGTGAGAAGCGTTATGCTGTTCAT^^ • 

0U G J A IIII G 7 AGA I G 7 ATCCAGGGA ACAAGCCATAACAGATGAATTGGT 
^ C ^ G 5 A I AA r rA J^ GA J" GGAGGATGAAA 

AATTTTATATCAGAGTTTTCAGAAAAACACATGATGTITTCTCGTCGGGTAAGTATGGATCGATTTTATCTATTT^C 
65 TGCTCAATTTGAACTTGGCTGAAGTACGTGGTGGCGACCAGGTGGTTGTTAAGGAAAACGACG AAACG AAAAATC 



4 



60 



CAGTTTATTTTGGTGCTGGGTCTGCTC 

TTCAGATAAGATTCGGAGTGTAGATCAGGTTTTTGTAGTCGGTCACAAAAATTTAGACATGGATGCTTTGGGCT 
GCTGTAGCTATGCAGTTGTrCGCCAGCAATCrrGArrGAAAATAGCTATGCTCm 
CAGATATTGAACGAGCTGT^ATTCATAGAAAAAGAAGGAGTTACGAAGTTGTTGTCTGTTAAGG 
GGATGGTGACCAATCoi iLinui i GATTCTTOTAGACCATTCAAAGACAGCCTTAACATTATCAAAAGAATTTTA 

tgatttatttacccaaaccattgttattgaccaccatagaagggatcaggattttccagataatgcggttattact 
tatatcgaaactggtgcaagtagtgccagtgactttggtaacgg 

CTAGTC GGAC ATTTGATGTTGCTAGCTATCTCAGAACGCGCGGAAGTGATAGTATTGCTATrrAr.nAA ATrnrrnr 

GACAGATTTTGAAGAATATCGTGAGGTCAATGAACTTATTTTACAGGGGCGTAAATTAGGTTCAGATGTACTAATA 
GCAGAGGCTAAGGACATGAAATGCTATGATACACTrrGTTArrAOT 

GGTATTGAAGCGAO i i nunti i GCGA^GAATACACAAGGATTTATCTCTATCTCAGCTCGAAGTCGTAGTAAAC 
TGAATGTACAACGGATTATGGAAGAGTTAGGCGGTGGAGGCCACTTTAATTTGGCAGCAGCTCAAATTAAAGATG 

T^CCTTGTCAGAAGCAGGTGAAAAACTGACAGAAATTGTATTAAATGAAATGAAGGAAAAGGAGAAAGAAGAA 
TGA 

MKKFYVSPIFPILVGLWFGVl^TFin^NNNLiTVULFLFVGGYVFLHCKLRVHYTRSDVEOIOYVNHOAEESLTALLE 

QMPVGVMKLNI^SGEVEWFNPYAELILTXEDGDFDLEAVQTIIKASVGNPSTYAKLGEKRYAVHMD^ 

^ A ™ E ^^ IG ' VSVDNYDDLEDCTSKD ^ 

DKFSVIDAFREESKQRQLPLTI^MGFSYGDGrmDEIGKVALLNLNLAEVRGGDQWVKErTOETKNPVTFGGGSAASK 
RTRTRTRAM^^AISDKI^«VDQVFVVGHKNLDMDALGSAVGMQLFAS^fV^ENSYALYDEEQMSPD!ERAVSFlEKEGV 
TXlJ^VTCDAMGMVTNRSLLILVDHSKTALTliKEFYDLFTQTIVIDHHRlUJODFPDNAVrrYIESGASS 
NSKKNR^RMQASVLMAGMMLDTKNFTSRVTSRTFDVASYLRTRGSDSIAIQElAATDFEEYRfV^OG 

ID115 663bp 

A T£ AAGTG £II G ™ TCTGGGC ^^ 

A E T 5Z G Z?^ G ^ CAGACTOTGA " CTACT 

AGA^GTCAACAAAGTCrrCAAGATT^AACTTTGGTCTAAAGAGGGAGrTGAAGTCAGTCATA 
TA ^ AC J AA I CAAGCTATGAAGGAlrr ^^ 

gcttcatttttaagtgaggagttgaaaaagtacaaagagtatcaatttgttgtaattcccctaa 
aaagagaagagagagccagttcttctaaaaatcgttcagagcgcttggggacagaacttc ci i ici ii attaaaa 
gaaactgttggaagaagctggtgctaaggatgtaaaaacattttcccttgtaagatga 

MKCLLCGOTMCTVLTFSSUiLRNDDSCLCSDCDSTFERlGEENCPNCMKTEI^TKCQDCQLWCKEGVEVSHRAIFTY 
ASSSKNRSERLGTELPFFIKSGVTIPKIClXIDDlYTTGATIbniVKKLLEEAGAKDVKTFSLVRZ v-utiutttK 
ID116 1299bp 

A T° AAAG I A ^ A ^ AG 1I TATCTCGGTCG ^ A ^ A CTGA GA ATGAATTAA 

gagaaacttccagcaatgagaaaggagaaggggaaacttttctgtcaacgctgtaatagtactattctagaagaa 
tggtatttgcccatcggtgcttactattgtcgagagtgcttgctgatgaagcgagtcagaagtcatc 

ACTATrTTCCGCAGGAGGATITrCCAAAGCAAGATGTrCTCAAATGGCGCGGCC^^ 

AGATGTTTGTTTGGAGCTGTACAAGCGCCTGCAACAGGATI 1 1 iL - ri GCGGGATAGCTTTGCTACATGGAGAATCG 
GAACCrrATTTTCGAACACCACTA^GrrGCAACAACCCATCACTrTATTCAA 

GA ? AA I G ^ A T^ G ^ GAA Z^ 1 i 1 1 " 1 AACAGCGACTTCG ACCAATGAGTTAGATAAAAAGGTCCCrnTAGGAGAACTA 

A I^ C I A ?I A 3 A £ A ^ AATCG ^ GTC 

CGTTACTCA 1 i iiiGCTTCAGAAATTAAGAAAGGGGAGCAGTTAGCAGAAATCTTACAGGAGCAATTTCCAAATG 
AGAAAATTGGCTTrGTATCTTCTGTAACAGAGGAra^^ 

CAATACrTATCAGTACGACAATCTTGGAGCGCGGACrrrACCTTCCCTTGTGTGGATGTT^CCTA 



61 

SKRSS5!!S53S!55io*« 

inil7 870bp — TrTA (~TQQCAACGCCGATTGGCAAT 

ATTCCTGATrTGATTGG^CTTCA^c 




in^8 345bp rnrTGGATGATTTTTCCCAACAA 
acgaagaatgtatgttttgtgacgagttgt FNSKLRE RLGEVEADAPVKAKHVRES 

inn9 639bp AnTGTTTTAGAGGCTCTGCTA 

3GGAGCAGGCAAGACCAGTU1 i i _ 



Ac£cGAGGTTAATCGT£G^^^ 



0 




^acctScc^aacatta^ 



62 



ID12I 28Sbo 

ATGAATATATTTAGAACAAAGAATGTTACrrrrAGATAAAACAGAGATGCATAGGCATTTGAAGTTAT^ 

attttgctgggtatcggagccatggtagggacaggcgtctttacaatca 

5 GCCCAGCCCTAGTG ATTTCAATCGTTATTTCTGCCTTGTGTGTGGG ATTATCAGCCCTC l l i i i iGCAGAATTTGCC 

TCGCGAGTACCCGCTACAGGAGGTGCCTATAGTTACCTCTATGCTATCTTAGGAGAATTCCCTGCCTGGTT 
GTTGGTTAACCATGATGGAGTTCATGACAGCCATATCAGGCGTAGCTTCGGGTTGGGCAGCTTATTTTAA 

MmFRTKWSLDKTEMHIWUeLW^ 
1 0 ATGGA YSYLY AILGEFPAWLAG WLTMMEFMTAiSGV ASGW AA YF 

ID124 I311bp 

ATGAAATCAAGAGTAAAGGAAACGAGTATGGATAAAATTGTGGTrCAAGGTGGCGATAATCGTCTGGTAGGAAGC 
15 GTGACGATCGAGGGAGCAAAAAATGCAGTCTTACCCTTGTrGGCAGCGACTATTCTAGCAAGTGAAGGAAAGACC 
GTCTTGCAGAATGTTCCGATTTTGTCCKjATGTCTTTATTATGAATCAGGTAGTTGGT 

ACTTTGATGAGGAAGCTCATCTTGTCAAGGTGGATGCTACTGGCGaCATCACTGAGGAAGCCCCTTACAAGTATG 

tcagcaagatgcgcgcctccatcgttgtattagggccaatccttgcccgtgtgggtcatgccaaggtatccatgcc 
aggtggttgtacgattggtagccgtcctattgatcttcatttgaaaggtctggaagctatggg 
20 cagacagctggttacatcgaagccaaggcagaacgcttgcatggtgctcatatctatatggactttccaagtgttg 
gtgc^cgcagaacttgatgatggcagcgactctggctgatggggtgacagtgattgagaatgctgcgcgtgagc 
rtnar.attgttgacrtagccattctccttaatgaaatgggagccaaggtcaaaggtgctggtacagagac tat 
ccattactggtgttgagaaacttcatggtacgactcacaatgtagtccaagaccgtatcgaagcaggaacctttat 

GGTAGCTGCTGCCATGACTGGTGGTGATGTCTTGATTCGAGACGCTGTCTGGGAGCACAACCGTCCCTTGATTGCC 

25 aagttacttgaaatgggtgttgaagtaattgaagaagacgaaggaattcgtgttcgttctcaactagaaaatcta 
aaagctgttcatgtgaaaaccttgccccacccaggatttccaacagatatgcaggctcaatt^ 
cagttgcaaaaggcgaatcaaccatggtggagacagttttcgaaaatcgtttcca aacct agaagagatgcgccg 

CATGGGCTTGCATTCTGAGATTATCCGTGATACAGCTCGTArrGTTGCTGGACAGCCTTTGCAGGGAGC^ 

ctttcaactgaccntcgtgccagtgcggccitgatttt 
30 aattggttcacttggatagaggttactacggtttccatgagaagttggcgcagctaggtgctaagattcagcggat 

tgaggcaagtgatcaagatgaataa 

MKSRVKETSMDKIWQGGDNRLVGSVTOGAKNAVLPIX^ 
EEAHLVKVDATGDITEEAPYKYVSKMRASIWUjPIIj^ 
35 AKA ERLHG AHI YMD FPS VG ATQ N LM M A ATL ADG VTV I EN A A REPEI VDLAILL N EM G AKVKG AGTEnTITG VEKLHG 

tthnvvqdrieagtfmvaaamtggdvurdavwehnrpliakllemgvevieedegirvrsqlenlkavhvktlphp 

GFPTDMQAQFTALMTVAKGESTMVETVFENRFQHLEEMRRMGL^ 
TG L V AQG ETV VG KL V H LDRG Y YGFH EKLAQ LG AK1Q RI E ASDEDEZ 

40 ID125 llOlbp 

ATGTTATTAGCGTCAACAGTAGCCTTGTCATTTGCCCCAGTATTGGCAAO-CAAGCAGAAGAAGTTCTTTG 

CACGTAGTGTTGAGCAAATCCAAAACGATTTGACTAAAACGGACAACAAAACAAGTTATACCGTACAGTATGGTG 

ATACTTTGAGCACCATTGCAGAAGCCrTGGGTGTAGATGTCACAGTGCTTGCGAATCTGAACAAAATCACTAATAT 

45 GGACTTGATTTTCCCAGAAACTGTTTTGACAACGACTGTCAATGAAGCAGAAGAAGTAACAGAAGTTGAAATC^^ 
AACACCTCAAGCAGACTCTAGTGAAGAAGTGACAACTGCGACAGCAGATTTGACCACTAATCAAGTGACCGTTGA 
TGATCAAACTGTTCAGGTTGCAGACCTTTCTCAACCAATTGCAGAAGTTACAAAGACAGTGATTGCTTCTGAAGAA 
GTGGCACCATCTACGGGCACrrCTGTCCCAGAGGAGCAAACGACCGAAACAACTCGCCCAGTTGCAGAAGAAGCT 
CCrCAGGAAACGACTCCAGCTGAGAAGCAGGAAACACAAACAAGCCCTCAAGCTG CATC AGCAGTGGAAGCAAC 

50 T AC AAC AAGTTC AG AAGC AA AAG AAGT AG CATC ATC A A ATG G AG CT AC AG C AG C AGTTTCT ACTT ATCAACC AG A 

AGAAACGAAAGTAATTTCAACAACrrACGAGGCTCCAGCTGCGCCCGATTATGCrGGACTTGCACT 
TGAAAATGCAGGTCTTCAACCACAAACAGCTGCCTTTAAW 

AGTGGTTATCGTCCAGGAGACAGTGGAGATCACGGAAAAGGTTTGGCTATCGACTTTATGGTACCAGAACGTTCA 
G A ATT A GG GG AT AAG ATTGCG G AAT ATG CT ATTC A AA AT ATG G CC AG CCGTGGC ATT AGTT AC ATC ATCTGG AAA 
55 CAACGTTTCrATGCTCCATTCGATAGCAAATATGGGCCAGCTAACACTTGGAACCCAATGCCAGACCGTGGTACT 
GTGACAGAAAATCACTATGATCACGTTCACGTTTCAATGAATGGATAA 

MIXASTVAI^FAPVLATQAEEVLWTARSVEQIQNDLTKTDNKTSYTVQ 
IFPETVLTTTVNEAEEVTEVEIQTPQADSSEEVTTATADLTTNQVTVDDQ 
60 VPEEQTTETTRPVAEEAPQETTPAEKQETQTSPQAASAVEATTTSSEAKEVASSNGATAAVSTYQPEETKVISTT^ 
APDYAGLAVAKSENAGLQPQTAAFKKKIJ.TCLALHPLVVIVQETVErr^ 
VALNTrSSGNNVSMLHSIANMGQLTLGTQCQTVVVZQKITMITFTFQZMD 

65 



63 



ID126 1281bn 

5^I G 3I? CACAOTAmG ^ GCT CATTTAGGATrGATAGCTATTTCACCGGTTTCAGTAGCTG 

T AATCC ACGTCTGCCTCTCT ATGTTAG 1 1 1 1 1 1 ATCCAATGCCTTGAATATT Lt l l riU CAAGTCTAGCTATTTTTCT 

13 ^5 A ^J G ^ AATCGGAGAAGTOTG 

^ACCCGTGGAGGCTAGTG^ACTGACA^ 
on A ^ A $5°5 A G I^ GCACGGA " ATCAAATGCACGCCTCCCT T^ATGCGACAAGTATAGGAATCT^ 

TAATGGTTTTCGCTGCrrTATITCTACGCTATCGTTACCAGCGCTATATGAGCTTGAAAGGATAG 

YHVTEALKnXLl^FLLGFLSIFAGKEMIGLLGTtRDVAESGGLYl^LVGGSrVLLGLMTSLGALIRATHNPBipLYV^Fl 
25 SNALNILFSSLAIFVLDMGIAGVAWGTIVSRLVGLVILWSQIJCLPYGKPTFGLDKELLTT^^ 
^?^ A r AGNAIGEVLT Q WYMPAFGVATA TVMLLARAVGEDD^^ 

30 IP127 894bp 

g T g J5 g ^ g ^" ATCagagcaggtgTa ^ Ga tggaacatot 
3Z± AA ^ AG ~ CAAAGAGAAC ^ 

™ GC I A ^ GGTCTGA ^ GT ^ AAG AGATGCGAGTrATGATATGAAGCAGGATGATT^GTA^ 

a^SSgVca^ 

aoactat gtcacragattggtagagaagttatggagagggaggaattttaccaagagattact^ 

40 A ™^ mATI ™ C CCTCA^^ 

AGG ™J AGAGAGAAGA ™TO AG ^ 

I GGC ^I ATGAAAAAGGAGAG TCTAA A GAAGGCTGTAAGCAGATGCAAGAGGCCATGCA^ 
AGGTCTTCCAGAGCAAGTAGCCTATTATCAGGAACACTACGAAAAATTTGTCAAAACnTAA rnTGATGTGTT 

45 ^!!^ GVKMEHLGKVF ^ reTSCNYSLK ^ G ESCro^ 
" E ^ MAQ,,PLYYSNDWGF Q KL Q^^ 

^myeulfgnlysitovdyvtiugrevmereefyqeisiuikrlvulalnc^^^ 

laYERNVFHYLKGFALYQKGQCKEGCKQMQEAMHIFDVLGLPEQW 





IASLL3 



ID1 1068 bp 



ATGTCTAAC ATTCA AAACATGTCCCTCGAGGACATCATGGGAGAGCGCrrrGGTCGCTACTCCAAGTACATTATTC 

AAGACCGG GCTTT GCCAGATATTCGTGATGGGTTGAAGCCGGTTCAGCGCCGTATTCTTTATTCTATGAATAAG 

TAGCAATACTTTTGACAAGAGCTACCGTAAGTCGGCCAAGTCAGTCGGGAACATCATGGGGAATTTCCACCCACA 

CGGGGATTCTTCTATCTATGATGCCATGGTTCGTATGTCACAGAACTGGAAAAATCGTGAGATTCTAGTTGAA 

CACGGTAATAACGGTTCTATGGACGGAGATCCTCCTGCGGCTATGCGTTATACTGAGGCACGTTTGTCTGAAATTG 

CACGCTACl ill n CAGGA TATCG AGAAAAAGACAGTTCCTTTTGCATGGAACrTTGACGATACGGAGAAAGAAC 

CAACGGTCTTGCCAGCAGCCTTTCCAAACCTCTTGGTCAATGGTTCGACTGGGATTTCGGCTGGTTATGCCA 

CATTCCTCCCCATAATTTAGCTGAGGTCATAGATGCTGCAGTTTACATGATTGACCACCCAACTGCAAAGATTGAT 

aaactcatggaattcttgcctggaccagacttccctacaggggctattattcagggtcgtgatgaaatcaagaaa 
gcttatgagactgggaaagggcgcgtggttgttcgttccaagactgaaattgaaaagctaaaaggtggtaaggaa 

CAAATCGTTATTATTGAGATTCCTTATGAAATCAATAAGCCCA^^ 

ATAACAAGGTAGCTGGGATTGCTGAGGTTCGTGATGAGTCTGACCGTGATGGTCTTCGTATCGCTATCGAACTTAA 
GAAAGACGCTAATACTGAGC I I G'ITCTCAACTACTTATTTAAGTACACCGACCTACAAATCAj\CTACAACTTTAAT 
ATGGTGGCGATTGACAATTTCACACCTCGTCAGGTTGGA7TGTTCCAATCCTGTCTAGCTATATCGCTCACCGTCG 



MSNIQNMSLEDtMGERFGRYSKYIIQDRALPDIRDGLKPVQRRILYSMN^^ 

SIYDAMVRMSQ^AVKNREILVEMHGNNGSMDGDPPAAMRYTEAR1^E1AGYLLQDIEICKTVPFAW 
AAFPNLLVNGSTG1SAGYATDIPPHNLAEVIDAAVYMIDHPTAXIDKLMEFLPGPDFPTGAIIQGRDEIKKAYETGKGRV 
VVRSKTEIEKLKGGKEQIVIIEIPYEir^NLVF^CIDDVRVN^VAGlAEVRDESDRDGLRIAIELKKDAI^LVLNYLre 
YTDLQINYNFNMVAIDNFTPRQVGLFQSCLA1SLTVEKZ 



atgccgacattagaaatagcacaaaaaaaactggagttcattaagaaggcagaagaatattacaatgccttgtgt 

ACAAATATACAGTTGAGCGGAGATAAACTAj\AAGTAATTTCCGTTACTTCTGTTAACCCTGGGGAAGGA^ 
ACTACTTCCATAAATAT AGCATG GTCGTTTGCGCGTGCaGGCTATAAAACTCTTTTGATCGATGGCGATACTCGAA 
ATTCAGTTATGTTAGGAu i i i i i AAATCTCGTG A A AA A ATT AC AG G G CT AACAG AATTTTT ATCTG G G AC AG CTG A 
TTTATCTCACGGTTTATGTGATACAAATATTGAAAATTTATTTGTAGTTCA^ 

ACAGCCTTGTTACAAAGTAAAAATTTTAATGATATGATTGAAACATTGCGTAAATATTTTGATTA 

atacaccgcctattggaattgttattgatgcggcaattatcactcaaaagtgtgatgcgtccatcttggtaacagc 
aacag gtgag gcgaataaacgtgatatccaaaaagcgaaacaacaattaaaacaaacagggaaactgttcctag 

GAGTTGTTTTAAATAAATTGGATATCTCGGTTAATAAGTATGGAGTTTACGGTTCCTATGGAAATTATGGTAAAAA 



MPTLEIAQKKLEFIKKAEEYYNALCTNIQL3GDKLKVISVTSVNPGEGKTTTSINIAWSFARAGYKTLLIDGDTRNSVM^ 

GVFKSREKrTGLTEFLSGTADLSHGLCDTNIENLFVVQSGSVSPNPTALLQSKNFNDMIETLRKYFDYinDTPPIGIVIDAA 

UTQKCDASILVTATGEANKRJDIQKAKQQLKQTGKLFLGVVLNKLDISVNKYGVYGSYGNYGKK2 



ATGGA GGCAAATATGAAACATCTAAAAACATTTTACAAAAAATGGTTTCAATTATTAGTCCrTTA 

TTTTrAGTGGAGCCn-GGGTAGTTTTrCAATAACTCAACTAACrCAA 

TAGTACTATTACACAAACTGCCTATAAGAACGAAAATTCAA 

TGTTTCTGTTATTACnTATTCGGCAAACAGACAAAATAGCGTATTTGGCAATGATGATACTGA 

cgaatctctagtgaaggatctggagttatttataaaaagaatgataaagaagcttacatcgtcaccaacaatcac 

GTTA-nTAATGGCGCCAGCAAAGTAGATATTCGATTGTCAGATGGGACTAAAGTACCT^ 
GAC^CTTTCrCrGATATTGCrGTCGTCAAAATCTCTTCAGAAAAAGTGACAACAGTAGCT^ 

GTAAGTTAACTGTAGGAGAAACTGCTATTGCCATCGGTAGCCCGTTAGGTTCTGAATATGCAAATACTGTCACTCA 
AGGTATCGTATCCAGTCTCAATAGAAATGTATCCTTAAAATCGGAAGATGGACAAG 

CCAAACrGATACTGCTATTAACCCAGGTAACTCTGGCGGCCCACTGATCAATATTCAAGGGCAGGTTATCGGAAT 

TACCrCAAGTAAAATTGCTACAAATGGAGGAACATCTGTAGAAGGTCTTGGTTTCGCAATTCCTGCAAATGATGCT 

ATCAATATTATTGAACAGTTAGAAAAAAACGGAAAAGTGACGCGTCCAGCITTGGGAATCCAGATGGTTAATTTA 

TCTAATGTGAGTACAAGCGACATCAGAAGACTCAATATTCCAAGTAATGTTACATCTGGTGTAATTGTTCXiTTCGG 

TACAAAGTAATATGCCTGCCAATGGT GACC TrGAAAAATACGATGTAATTACAAAAGTAGATGACAAAGAGATTG 

cttcatcaacagacttacaaagtgctctttacaaccattctatcggagacaccattaagataacctactatcgta^ 
cgggaaagaagaaactacctctato^acttaacaagagttcaggtgatttagaatcttaa 

MEANMKHLKTFYKKWFQLLVVIV1SFFSGAJJ3SFS^ 

rTYSANRQNSVFGNDDTDTDSQRISSEGSGVIYKKNDKEAYIVTNNHVINGASKVDIRLSDGTKVPCEIVGADTFSDlAV 
v KlSSEKVTTVAEFGDSSKLTVGETAlAJGSPLGSEYA^^^VTQ 



AGAAGTGA 



ID12 684bp 



ATAA 



ID13 1182bp 



10 



20 



30 



4 



65 



LINIQGQVIGrrSSKIATNGGTSN^GLGFAIPANDAINIIEQI^KN^ 

RSVQSNMPANGHLEKYDVrTKVDDKElASSTDLQSALYNHSIGDTIKrryYRNGKEETTSDCLNKSSGDLESZ 
W\S ?39bp 

ATGGCAGAAATTTATCTAGCAGGTGGTTGTTTTTGGGGCCTAGAGGAATATTTTTCACGCAT^ 
AAA CCA GTGTTGG CT A CG CT A ATG GTC AAGTCG AAA CG A CC AATT ACC AGTTG CTC AAG G A A A C AG A CC ATG C AG 
AAACGGTCCAAGTGATTTACGATGAGAAGGAAGTGTCACTCAGAGAGATTTTACTTTATTATTTCCGAGTTATCGA 
TCCTCTATCTATCAATCAACAAGGGAATGACCGTGGTCGCCAATATCGAACTGGGATTTATTATCAGGATGAAGC 
AGATTTGCCAGCTATCTACACAGTGGTGCAGGAGCAGGAACGCATGCTGGGTCGAAAGATTGCAGTAGAAGTGGA 
GCAATTACGCCACTACATTCTGGCTGAAGACrACCACCAAGACTATCTCAGGAAGAATCCTTCAGCT^ 
^JCG ATGTG ACCG ATG CTG ATAAGCCATTGATTG ATGCAG CAAACTATG AAA AGCCTAGTCAAGAGGTGTTG AAG 

GCCAG IH ATCTGAAGAOT ^ 
i c aa *2^ga^^ 

13 AGGTTGTGGTTCGCCAAui l l 1 AGCCGTCCGATTTCCAAAGAGTTGATTCATTATTACAAGGATCTGAGCCATGGA 

TGGCTATCTATTGCCTTACTTAAACAAATAA 



MAEIYl^GGCFWGLEEYFSRISGVLETSVGYANGOVETTNYQLl^ETDHAETVQVIYDEKEVSLREILLYYFRVro 
NQQGNDRGRQYRTGIYYQDEADLPAIYTVVQEQERMLGRKIAVEVEOLRHYU^EDYHODYUOCNPSGYOTIDVTDA 
DKPLIDAANYEKPSOEVLK^LSEESYRVTOEAATEAPFTNAYDQTreEGIYVDrTTGEPLFFAKDKFASGCGW 
SKEUHYYKDUHGMEWEVRSRSGSAHLOHVFTTCPRELCCLRra 



25 ID17 870bp 



A I^ GA "^ GTACCTGCAACCAGTGCC ^ TAT CGGGCC 

^T^^^^^^^ AGGTCXG CG AAG /^A CG AG ATG AGTG G CTG ATXG AAC A CC AG A TTG G CAAATGG ATTCC ACATG 

A £ GAGGG I A ^^ 

S^5^ AC 3^ C ^ TCAGACCATGAAA ^ 

2 A SI2T GAin ^ CTAG ^ ACA ^ 
T G ^ G I^ AAGGAAG ^ 

GA ^ G ^ GGGGA ^ GGAATCGAGGGAGAG CTCTTCCATGAGCG 

GAT^GCAAGTGACCAAAGAAAATGGGGCCTATGCAACCTACCTTTCTGGTGCTGGGCCGA^ 
GG ^^ GATGACAAGATGCCAA ^ 

GAG AGTTG ATACCCAAGGTGTCCGTGT AG AAGCAAAATAA 

5Vc^^ VLPKK ^^ AVAASS ^ NVAV ^^ 

YLSGAGPTVMVLASHDKMPTIKAELEKQPFKGKLHDLRVDTQGVRVEAKZ 
45 ID20 564bo 

ATGAAATATCACGATTACATCTGGGATTTAG 

II GAA ^ A " GGCACTGTATGGTATCACACAAG 
SO ?-r-^^A A "^"^"^^ CT CC C A ATTT A G AG AATTTTTT A G A A A A G T A C AA G G A AAATG AAG CCAGA <^\GCnTGA 

50 A ^ G f GGA ^ 

^ GA I GGAAA I^^ 

^ A ^S AGGCmAAGAGAAAGCCAAATCC CGAATCCATG 

60 ID21 187Sbn 

^°P C ^ TG S2 TCCAGCGATGTACA ^ GGATC AACCTCAAAAGAAGGTCT^ 
65 TACTGTTarGGATGATGGCCGTGGTATCCCAGTCGATATrCAGGAAA^^ 





66 



ctttacagtccttcacgct ggag gaaagttccgcggtggtggatacaaggtttcacgtggtcttcacgggctggg 

gtcgtcagtagttaatgccctttccactcaattagacgttcatgttcacaaaaatggtaagattcattaccaagaa 

taccgtcgtggtcatgttgtcgcagatcttgaaatagttggagatacggataaaag^ggaacaactgttcacttc 

acaccggacccaaaaatcttcactg aaacaacaatctttgattttgataaattaaataaacggattcaagagttg 

gcctttctaaatcgcggtcttcaaatttcaattacagataagcgccaaggtttggaac 

atgaaggtgggattgctagttacgttgaatatatcaacgagaacaaggatgtaatctttgatacaccaatctata 

cagacgotgagatggatgatatcacagttgaggtagccatgcagtacacaactggttaccatgaj\aatgtcatga 

gtttcgccaataatattcatacccatgaaggtggaacacatgaacaaggtttccgtacagccttgacacgtgttat 

caacgattatgctcgtaaaaataagttactgaaagacaatgaagataatttaacaggggaagatgttcgcgaagg 

CTT AACTG CAGTT ATCTC AGTT AAAC ACCC AA ATCC AC AGTTTG AAG G AC AA ACCAAG ACCAAATTGG G AAAT AG 
CGAAGTGGTCAAGATTACCAATCGCCTCTTCAGTGAAGCTTTCTCCGATTTCCTCATGGAAAATCCA 

aaacgtatcgtagaaaaaggaattttggctgccaaggctcgtgtggcrgccaagcgtgcgcgtgaagtcacacgt 
aaaaaatctggtttggaaatttccaacctrccagggaaactagcagactgttcttctaataa 
aactcttcatcgtcgaa ggag actcagctggtggatcagccaaatctggtcgtaaccgtgagtttcaggctatcct 
t ccaat tcgcggtaagattttgaacgttgaaaaagcaagtatggataagattctagccaacgaagaaattcgtag 

TCTTTTC^CAGCCATGGGAACAGGATTTGGCGCAGAATrrGATGrrrCGAAAGCCajr^ 

ATGACCGATGCCGATG TCGAT GGAGCCCACATTCGTACC C I 1 L I ' 1T1 ' AACCTTG ATTTATCGTTATATGAAACCAA 

TCCTAGAAGCTGCnTATGTTTATATTGCCCAACCACCAATCTATGCrrGTCAAGGTTGGAAGCGA 

TATCCAGCCGGGTGCAGATCAAGAAATCAAACTCCAAGAAGCrjTAGCCCGTTATAGTGAAGGTCGTACCAAACC 

GACTATTCAGCGTTATAAGGGGCTAGGTGAAATGGACGATCATCAGCTGTGGGAAACAACCATGGATCCCGAACA 

TCGCTTGATGGCTAGAGTITCTGTAGATGATGTGCAGAAGCAGATAAAATCTTTGATATGTTGA 

NnXEIKNLQAQDYDASQIQVLEGL^AVRMRPGMYIGSTSKEGLHHLVW 

DGRGIPVDIQEKTGRPAVETVFTVLHAGGKFGGGGYKVSGGLHGVGSSVVNALSTQLDVHVHKNGKIHYQEYRRGHV 
VADLEIVGDTDKTGTTVHFTPDPKin-ETTIFDFDKLN^ 

NENKDVlFDTPIYTDGEMDDrTVEVAMQYTTGYHENVMSFANNIHTHEGGTHEQGFRTALTRVINDYARKNKLLKDN 

EDNLTGEDVREGLTAVISVKHPNPQFEGQTKTKLGNSEVVKr™ 

RAREVTRKKSGLEISNLPGKLADCSSNNPAETELFIVEGDSAGGSAKSGRNREF 

RSLFTAMGTGFGAEFDVSKARY0KLVLMTDADVDGAHIRTLLLTLIYRYMKPILEAGYVYIAQPPIYGVKVGSEIKEY1 
QPGADQEIKLQEAl^RYSEGRTKPTIQRYKGLGEMDDHQLWETTMDPEHRLMARVSVDDVQKQIKSUCZ 



ATG AGT A G ACGTTTTAAAAA ATC ACGTTC ACA G AA AGTG AA G CG AA GTGTT AAT AT AGTTTTG CTG ACT ATTT ATT 
TATTCrrrAGTITGTTrrrTATTGTTCTTAATCrTTAAGTA 

CTGCGTTAGTCCTACTAGTTGCCTTGGTAGGGCTACTCTTGATTATCTATAAAAAAGCTGAA 

CTGTTGGTGTTCTCTATCCTTGTCAGCTCTGTGTCGCTCTTTGCAGTACAGCAGTTTGTTGGACT 

AAATGCGACITCTAATTACTCAGAATATTCAATCACTGTCGCTGTTTTAGCAGA 

CAACTGACGAGTGTGACAGCACCGACTGGGACTAATAATGAAAATATTCAGAAATTACTAGCTGATATCAAGTCA 

AGTCAGAATACCGATTTGACGGTCAA CCAG AGTTCGTCTTACTTGGCAGCTrACAAGAGTTTGATTGCAGGGGAG 

ACTAAGGCCATTGTCCTAAATAGTGTCTTTGAAAACATCATCGAGTCAGAGTATCCAGACTACGCATCGAAGATA 

A AA A AG ATTT AT ACT AAGGGATTCACT AAA AAAGT AG AAG CTCCT A AG ACGTCTAj\GAGTCAGTCTTTC AAT ATC 

TATGTTAGTGGAATTGACACCTATGGTCCTATTAGTTCGGTGTCGCGATCAGATGTCAACATCCTGATGACTGTCA 

ATCGAGATACCAAGAAAATCCTCTTGACCACAACGCCACGTGATGCCTATGTACCAATCGCAGATGGTGGAAATA 

ATCAAAAAGATAAATTGACTCATGCGGGCATTTATGGAGTrGATTCGTCCATTCACACCTTAGAAAATCTCTATGG 

AGTGGATATCAATTACTATCTGCGATTGAACrrCACirCGTTTTTGAAATrGATTGATT^ 

TTTATAATGATC AAGA ATTTACTGCCCATACGAATGGAAAGTATTACCCTGCAGGCAATGTTCATCTTGATTCAGA 
ACAGGCTCTCGGTTTTGTTCGTGAGCGCTACTCCCTAGCAGATGGCGATCGTGACCGCGGGCGCCATCAACAAAA 
GGTGATTGTGGCTATCCnTCAAAAATTAACGTCAACCGAAGTGCTGAAAAATTATAGTACGATCATTAATAGCTTG 
CAAGATTCTATCCAAACAAATATGCCACTTGAGACCATGATAAATTTGGTCAATGCTCAGTTAGAAAGTGGAGGG 
AATTATAAAGTAAATTCTCAAGATTTAAAAGGGACAGCTCGGATGGATCITCCTTCrrATGCAATGC^ 
ACCTCTATGTGATGGAAATAGATGATAGTAGTTTAGCTGTAGTTAAAGCAGCTATACAGGATGTGATGGAGGGTA 



MSRRFKKSRSQKVKRSVNlVLLTIYLLLVCFLLFLtFKYNII^ 

ILVSSVSLFAVQQFVGLTNRLNATSNYSEYSISVAVI^DSEIENVTQLTSVTAPTCTNNENIQKLLADIKSSQrfTDLTVNO 

SSSYLAAYKSLIAGETKAIVLNSVFENIIESEYPDYASKiKKIYTKGra 

NILMTVNRDTKKILLTTTPRDAYVPIADGGN^ 

IDVYNDQEFTAHTNGKYYPAGNVHLDSEQALGFVRERYSLADGDRDRGRHQQK^VAILQFCLTSTEVLKNYSTUNSLO 
DSIQTNMPLETMINLVNAQLESGGNYKVNSQDLKGTGRMDLPSYAMPDSNLYVMEIDDSSLAVVKAAIQDVMEGR2 



IDS4 I446bp 



GATGA 



ID55 732bp 



20 



35 



40 



4 



67 



ATGATAGACATCCATTCGCATATCGTTTTTGATGTAGATGACGGTCCCAAGTCAAGAGACGAAAGCAAGGCTCTC 
TTGGCAGAATCCTACAGACAGG GGGTG CGAACCATTGTTTCTACCTCTCACCGTCGCAAGGGCATGTTTGAAACTC 
CGG AAG AG AAG AT AGCAG AAAACTTTCTTCAG GTTCGGG AA AT AGCT A AGG AAGTGG CG AGTG ACTTGGTCATTG 
CTT ACG GGG CTG AAATTT ATT A CAC A CC AG ATGTTCTG G AT A AG CTGG AA AA AAAGCG G ATTC CG A CCCTC AATG 
D AT AGTCGTT ATG CCTTG AT AG AGTTT AGTATG A A C A CTCCTT ATCGCG AT ATTC AT AG CG CCTTG AG CAAG ATCTT 

GATGTTGGGAATTACTCCAGTCATTGCCCaCATTGAGCGCTATGATGCTCTTGAAAATAATGAAAAACGCGTTCGA 
G AACTG ATCG AT ATGG G CTGTT ACACG C AAGTAA AT A GTTC A C ATGTCCT C A A A CCC AAA CTTTTTG G CG AA CGTT 
ATAAATTCATGAAAAAAAGAGCTCAGTATTTTTTAGAGCAGGATTTGGTTCATGTCATTGCAAGTGAT 
1 n I? AGAGGG T AGACCT ^ 

1 U GG AACTTT7TATAG ACAATCCTCG AAAAATTGTAATGGATCAACTAATTTAG 

MIDIHSHIVFDVDDCPKSREESKALLAESYRWVRTIVro 

YYTPDVLi>KLEKKRIPTLNDSRYALIEFSMOTPYRDIHSAl^KJLMLGrrPVlAHIERYDALENNE^ 
NSSHVLKPKLFGERYKFMKXRAQYFl^QDLVHVlASDMHN^ 



ID58 3990bn 

II!^IZ T ^ A J AATCGCTATCAATATA 

TTTTCrATTCGTAAATACGCTCTAGGAGCAGCTTCTGTTCTAATTGGATTTGCCn^ 
CCGATGGAGTTACTCCTACTACTACAGAAAACCAACCGACCATCCATACGGTTTCTGATTCCCCTO^TCATCCGA 
AAATCGGACTGAGGAAACACCTAAAGCAGTGCrrCAACCAGAAGCTCCAAAAACTGTAGAAACAGAAACTCCAG 
CTACTGATAAGGTAGCTAGTCTTCCAAAAACAGAAGAmAAAACCACAAGAGGAAGTTAGTTCAACTCCTAGTGATA 

aagcagaagtggtaactccaacttctgctgaaaaagaaactgctaataaaaaggcagaagaagct 
AAGGaag ^ gcg ^agaggttgattctaaagagtcaaatacagacaagactgacaaggataaaccagct 
agatgaagcgaaagcagaggct^caaaccgg<^cagaggcaggaaaggaacgtgctgcaactgtaaatga^ 
aactagcgaaaaagaaaattgtttctattgatgctggacgtaaatatttctcaccagaacagctcaa 
tcgataaagcgaaacattatggctacactgatttacacctattagtcggaaatgatggactccgtttcatc^ 
-in cga ™ tgagcatcacagg taacggcaagac^ 

30 TG ATT ATT AC A A CG ATC C AAACGGCAATC A CTT A ACAG A AAGTC AAATG ACAG ATCTG ATT AACT ATG CC AAA G A 

£^ G £^ 

2 GGAA I£^ 

TGTCG CTTTT AC AAA AG CCCTT ATCG A C AAGT ATG CTG CTT ATTTCG CG AAAA AG ACTG AAATCTTC AAC ATCGG A 
CTTGATGAATATGCCAATGATGCGACAGATGCTAAAGGTTGGAGTGTGCTrCAAGCTGATAAATACTATCCAAAC 
GAAGGCTACCCTGTAAAAGGCTATGAAAAATTTATTGCCTACGCCAATGACCTCGCTCGTATTGTAAAATC^ 

?2I^ GA ^ CAATGGC1 ^ AACGACGGTATCTA 
A l! GATCG I3I£ rATGTGGACTGGTGG " GGGGA 
AGGAAA T G <™ ATAC ^ 
GGA I9 AGGGGCTCAATGOT ^ 

CATCGGT^TATGGTAGCTGClTGGGCTGACACTCCATCTGCACGTrATTCACCATCACGCCTCTTCAAACTCATG 
CGTCATTTTGCAAATGCCAACGCTGAATACrrCGCAGCTGATTATGAATCT 

CCAAAAGACCTGAACCGTTATACTGCAGAAAGCGTCACGGCCGTAAAAGAAGCTGAAAAAGCTATTCGCTCTCTC 
GATAGCAACCTTAGCCGTGCCCAACAAGATACGATTGATCAAGCCATTGCTAAACTTCAAGAAACTGTCAACAAC 

SO ^^^^ AA S AGTCCAGGTCA ^ TGGAT GCTATGCTGGTTGCCATGGAAAAATTAGGTA^ 

GCCCACTTTGATAAAGTTTCAAAAACAACTATGGACTTGAAAAACGAAGAAGCGATGAACTTTGTAAAAC^CCTC 

atcggtaaatacatggacttctttgcaggtaaaacaaagattttcaactttggtactgacgaatacgccaacgat 

GCGACTAGTGCCCAAGGCTGGTACTACCTCAAGTGGTATCAACT^^ 

fin 



30 



40 



45 



50 



55 



4 



68 



CAAGAAGAAGAAATTCCATTTACCACAGTGA 

ACTAAGGGCGTCAATGGACATCGTAGCA-ACTTCTACTCTGTGAGCACTTCTGCCGATGCT 

CTTGTAAATAGTGTCGTAGCACAGGAAGCCGTTACTCAAATAGTCGAAGTCGGAACTATGGTAACACATGTAGGC 

gatgaaaacggacaagccgctattgctgaagaaaaaccaaaactagaaatcccaagccaaccagctccatcaac 
5 tgctcctgctgaggaaagcaaao 1 I LT1 cctcaagatccagctcctgtggtaacagagaaaaaacttcctgaaac 

aggaactcacgattctgcaggactagtagtcgcaggactcatgtccacactagcagcctatggactcactaaaag 
aaaagaagactaa 

MrYTlAlMTMQSGGFAMl^EKQQ 

10 TPKAVLQPEAPKTVETETPATDKVASLJ'KTEEKPQEEVSSTPSDKAEVVTPTSAEKCT 
SNTDKTT>KDKPAKKDEAKAEADKPATEAGKERAA7VNEK1^^ 
VGNIXiLRFMLDDMSrrANGKTYASDDVKRArEKGTN^ 

AILNAMKELGIQNPNFSYFGKKSARTVDLDNEQAVAFTKALIDKYAAYF^ 
DKYYPNEGYPVKGYEKFIAYANDI^RJVKSHGLKPM 

15 EKGHQILOTNDAWYYVLGRNADGQGWYNLDQGLNGIKNnrPrrSVPKTXGADIPnGGM 
MRHFANANAEYFAADYESAEQALNEVPKDLNRYTAESVTAVKEAEKAIRSLD^ 
LTPEAQKEEEAKREVEKLAKNKVISIDAGRKY^ 

SDDVKKAIIEGTXAYYDDPNGTAl-TQAEVTELIEYAiCSKDIGLIPAINSPGHMDAMLVAMEKLGIKNPQ 
MDLKNERAMNFVKALIGKYMDFFAGKTKIFNFGTDEYANDAT3AQGWYYLKWYQL 
20 QPhlAFNDGFYYEDKDDVQFDKDVLISYWSKGWWGYNLASPQYLA^KGYKFLKTNGDWYYILGQKPE 
EOTGICTPFNQLASTKYPEVDLPTVGSMI^rWADRPSAEYKEEEIFELNTTAFADHNKD 

YSKESLEAU5AAKTALNYNLNRiNnCQAELDTLVANLKAALQGLKPAVTHSGSLDENEVAANVETiy>ELrrRTEElPFEVI 
KKENPNLPACQENirTAGVKGERTHYlSVLTENGKTT^ 
DIQEEEIPFTTVTCENPLLLKGKTQVrTKGV^^ 
25 NCKJAAIAEEKPKLEIPSQPAPSTAPAEESKVLPQDPAPVVTEKKLPETC^ 



ID122 825bp 

ATGAACAAAAAAACAAGACAGACACTAATCGGACTGCTAGTGTTATTGCTTTTGTCTACAGGGAGCTATTA 
AAGCAGATGCCGTCGGCACCTAATAGTCCCAAAACCAATCTTAGTCAGAAAAAACAAGCGTCTGAAGCT^ 

C ^ GC ^II2? CAGAGAOTOT< ^^ CAGACGCAOT 

AGGTGCnTTTATCGTCAATGGTAATAAAACAAATCTAGATGCCAAGGTTTCAAGTAAGCCCTACGCTGACAATAA 
AACAAAGACAGTGGGCAAGGAAACTGTTCCAACCGTAGCTAATGCCCTCTTGTCTAAGGCCACTCGTCAGTACAA 
« GA^OTA^GAAACTGGGAATGGTTCAACTTTCTTGGACTCCT^ 

J5 CTCTTATACCCATGCAGTCGATAGAGGTCATTTGTTAGGCTATGCCTTAATCGGTGGTTTGGATGGTTTTGATGC 

CAACAAGCAATCCTAAAAACATTGCTGTTCAGACAGCCTGGGCAAATCAGGCACAAGCCGAGTATTCGACTGGTC 
AAAA CT ACT ATG AA AG C A AG GTGCGT AAA G CCTTG G ACC A AAA C AAG CGTGTCCGTT AC CGTGT AA CCCTTT A CT 
ACGCTTCAAACGAGGATTTAGTTCCCTCAGCTTCACAGATTGAAGCCAAGTCTTCGGATGGAGAATTGGAATTCA 

atgttctagttcccaatgttcaaaagggacttcaactggattaccgaactggagaagtaactgtaact 



MNKKTRQTLIGLLVLLIXSTGSYYKOMPSAPNSPKTNI^QKKQASEAPSOALAESVLTDAVKSQIKGSI^W 
NGNKTNLDAKVSSKPYADNKTKTVGKETVPTVAN ALLS KATRQYXNRKETGNGSTSWTPPG WHO 

DRGHLLGYALIGGLDGFDASTSNPKNIAVQTAWANQAQAEYSTGQNYYESKVRKALDQNKRVRYRVTLYYASNEDL 
VPSASQIEAKSSDGELEFNVLVPNVQKGLQLDYRTGEVTVTQ2 

ID123 22Sbo 

GTGCTAAGATTCAGCGGATTG^^GCAAGTGATGAAGATGAATAAGAAATCAAGCTACGTAGTCAAGC^^ 
TTAGTCATCATAGTACTGATTTTAGGTACTCTGGCTCTAGGAATCGG7TTAATGGTAGGTTATGGAATC^ 
AGGGTCAAGATCCATGGGCTATCCTCTCTCCAGCAAAATGGCAGGAATTGATTCATAAATTTACAGGAAATTAG 

VLRFSGLRQVMKMNKKSSYVVKRLLXVir^LILGTl^LGIGLMVGYGILGKGQDPWAILSPAKWQELIHKFTG^ 



